The data (e)speaks: Difference between revisions
From Algolit
m |
|||
(16 intermediate revisions by 3 users not shown) | |||
Line 3: | Line 3: | ||
| Type: || Algoliterary exploration | | Type: || Algoliterary exploration | ||
|- | |- | ||
− | | Datasets: || | + | | Datasets: || [[Frankenstein]], [[Learning from Deep Learning]], [[nearbySaussure]], [[astroBlackness]] |
|- | |- | ||
| Technique: || espeak | | Technique: || espeak | ||
Line 10: | Line 10: | ||
|} | |} | ||
+ | In the process of making the Algolit datasets, careful consideration was given to the selection of the source texts. Our attempt was to have a variety of tone of voices that highlights the heterogeneity of all of them combined. | ||
+ | |||
+ | The texts were gathered from aaaaarg.fail, gen.lib.rus.ec, archive.org and gutenberg.org, processed with terminal commands such as [https://en.wikipedia.org/wiki/Pdftotext pdftotext] in order to generate .txt files and stripped of punctuation marks with the help of [https://gitlab.constantvzw.org/algolit/algolit/blob/master/algoliterary_encounter/algoliterary-toolkit/text-punctuation-clean-up.py a Python code snippet]. | ||
+ | |||
+ | The ensuing datasets are: | ||
+ | * [[Frankenstein]] | ||
+ | * [[Learning from Deep Learning]] | ||
+ | * [[nearbySaussure]] | ||
+ | * [[astroBlackness]] | ||
+ | |||
+ | ''The data (e)speaks'' is an audio installation that reads out specific sentences from their text body. | ||
+ | |||
+ | <small>Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''145'''. Paragraph: '''3'''.</small><br> | ||
+ | ''She returned over and over again to world-building aspects of the narrative — the social organization of the colony, the physics governing Bow, the possible organisms that might live in such a place, Earth-historical parallels to their situation—but could never get the plot itself to gel.'' | ||
+ | |||
+ | <small>Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''186'''. Paragraph: '''3'''.</small><br> | ||
+ | ''More important than technique, however, is for authors to remember that they are writing about people. Authors who forget this, who do not relax and get comfortable with their racially different characters, can wind up creating unbelievable, self-consciously manipulated puppets; pieces of furniture who exist within a story but contribute nothing to it; or stereotypes guaranteed to be offensive.'' | ||
+ | |||
+ | <small>Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''46'''. Paragraph: '''5'''.</small><br> | ||
+ | ''The sole competitor to the unchallenged hegemony of the Patternists, beyond the social instability implied by their own endless internal struggle for dominance, comes in the form of the Clayarks, mutated humans sick from an interplanetary virus, who are partially immune to Patternist control and who exist in the wild spaces between Patternist enclaves.'' | ||
+ | |||
+ | <small>Dataset: '''[[astroBlackness]]'''. Source: '''Social Text - Afrofuturist Issue'''. Author: '''Tracie Morris'''. Publisher: '''Duke University Press'''. Year: '''2002'''. Page: '''93'''. Paragraph: '''1'''.</small><br> | ||
+ | ''My first word was an error<br>'' | ||
+ | ''according to the machine I spoke it in.<br>'' | ||
+ | ''Whispering into an orifice used to be intimate,<br>'' | ||
+ | ''Now, the Neural Network Noir twitters from every misplaced<br>'' | ||
+ | ''exclamation. Deep spell check.'' | ||
+ | |||
+ | <small>Dataset: '''[[astroBlackness]]'''. Source: '''The Immeasurable Equation'''. Author: '''Sun Ra'''. Publisher: '''Hartmut Geerken'''. Year: '''2005'''. Page: '''306'''.</small><br> | ||
+ | ''· = aim<br/>'' | ||
+ | ''· = end<br/>'' | ||
+ | ''· = period<br/>'' | ||
+ | ''· = time<br/>'' | ||
+ | ''· = era<br/>'' | ||
+ | ''· = age<br/>'' | ||
+ | ''· = cycle<br/>'' | ||
+ | |||
+ | <small>Dataset: '''[[nearbySaussure]]'''. Source: '''The Cambridge Companion to Saussure'''. Author: '''Peter Wunderli'''. Publisher: '''Cambridge University Press'''. Year: '''2006'''. Page: '''21'''. Paragraph: '''3'''.</small><br> | ||
+ | ''All these concessions open the door to coincidence and finally turn the anagram into a phenomenon of probability. In the end one is left wondering whether it is not possible to extract any word out of any text of a certain length.'' | ||
− | + | <small>Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure, Derrida, and the Metaphysics of Subjectivity'''. Author: '''Robert M. Strozier'''. Publisher: '''Mouton de Gruyter'''. Year: '''1988'''. Page: '''21'''. Paragraph: '''3'''.</small><br> | |
+ | ''“the linguist's task is not to study utterances for their own sake … but to provide evidence about the nature of the underlying system, the English language”'' | ||
+ | |||
+ | <small>Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure, Derrida, and the Metaphysics of Subjectivity'''. Author: '''Robert M. Strozier'''. Publisher: '''Mouton de Gruyter'''. Year: '''1988'''. Page: '''49'''. Paragraph: '''2'''.</small><br> | ||
+ | ''It is the point of view, as Saussure says, which determines the fact or entity--the object--within the mass of data. “Point of view” is a technical term for Saussure: it is another term for form; that is, the form represents what is induced upon the mass of data, now considered as a material capable of accepting the various forms.'' | ||
+ | <small>Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure's Philosophy of Language as Phenomenology'''. Author: '''Beata Stawarska'''. Publisher: '''Oxford University Press'''. Year: '''2015'''. Page: '''43'''. Paragraph: '''1'''.</small><br> | ||
+ | ''Temporality proper to language itself intersects the signifier and the signified in a relation of mutual interdependency; this naturally makes the ideas themselves subject to mutation and change. Importantly, the source materials do not support the structuralist notion of a fixed and immutable system; such a notion emerges rather as the philosophers’ abstraction removed from the concrete reality of language in usage;'' | ||
− | + | <small>Dataset: '''[[Learning from Deep Learning]]'''. Source: '''[https://deeplearning4j.org/word2vec#embed Deep Learning for Java]'''. Author: '''Skymind'''.</small><br> | |
+ | ''The vectors we use to represent words are called neural word embeddings, and representations are strange. One thing describes another, even though those two things are radically different. As Elvis Costello said: “Writing about music is like dancing about architecture”.'' | ||
+ | <small>Dataset: '''[[Learning from Deep Learning]]'''. Source: '''Deep Learning'''. Author: '''Ian Goodfellow and Yoshua Bengio and Aaron Courville'''. Page: '''112'''. Paragraph: '''1'''.</small><br> | ||
+ | ''In practice, the learning algorithm does not actually find the best function, but merely one that significantly reduces the training error. These additional limitations, such as the imperfection of the optimization algorithm, mean that the learning algorithm’s effective capacity may be less than the representational capacity of the model family.'' | ||
− | The | + | <small>Dataset: '''[[Learning from Deep Learning]]'''. Source: '''Deep Learning'''. Author: '''Ian Goodfellow and Yoshua Bengio and Aaron Courville'''. Page: '''472'''. Paragraph: '''3'''.</small><br> |
+ | ''The idea of forming an embedding for a symbol was extended to the idea of an embedding for a word by Deerwester et al. (1990). These embeddings were learned using the SVD. Later, embeddings would be learned by neural networks.'' | ||
+ | <small>Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.</small><br> | ||
+ | ''My education was neglected, yet I was passionately fond of reading.'' | ||
− | '' | + | <small>Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.</small><br> |
+ | ''The different accidents of life are not so changeable as the feelings of human nature.'' | ||
+ | <small>Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.</small><br> | ||
+ | ''Invention, it must be humbly admitted, does not consist in creating out of void, but out of chaos; the materials must, in the first place be afforded: it can give form to dark shapeless substances but cannot bring into being the substance itself.'' | ||
[[Category:Algoliterary-Encounters]] | [[Category:Algoliterary-Encounters]] |
Latest revision as of 19:06, 1 November 2017
Type: | Algoliterary exploration |
Datasets: | Frankenstein, Learning from Deep Learning, nearbySaussure, astroBlackness |
Technique: | espeak |
Developed by: | & Algolit |
In the process of making the Algolit datasets, careful consideration was given to the selection of the source texts. Our attempt was to have a variety of tone of voices that highlights the heterogeneity of all of them combined.
The texts were gathered from aaaaarg.fail, gen.lib.rus.ec, archive.org and gutenberg.org, processed with terminal commands such as pdftotext in order to generate .txt files and stripped of punctuation marks with the help of a Python code snippet.
The ensuing datasets are:
The data (e)speaks is an audio installation that reads out specific sentences from their text body.
Dataset: astroBlackness. Source: Octavia E. Butler. Author: Gerry Canavan. Publisher: University of Illinois Press. Year: 2016. Page: 145. Paragraph: 3.
She returned over and over again to world-building aspects of the narrative — the social organization of the colony, the physics governing Bow, the possible organisms that might live in such a place, Earth-historical parallels to their situation—but could never get the plot itself to gel.
Dataset: astroBlackness. Source: Octavia E. Butler. Author: Gerry Canavan. Publisher: University of Illinois Press. Year: 2016. Page: 186. Paragraph: 3.
More important than technique, however, is for authors to remember that they are writing about people. Authors who forget this, who do not relax and get comfortable with their racially different characters, can wind up creating unbelievable, self-consciously manipulated puppets; pieces of furniture who exist within a story but contribute nothing to it; or stereotypes guaranteed to be offensive.
Dataset: astroBlackness. Source: Octavia E. Butler. Author: Gerry Canavan. Publisher: University of Illinois Press. Year: 2016. Page: 46. Paragraph: 5.
The sole competitor to the unchallenged hegemony of the Patternists, beyond the social instability implied by their own endless internal struggle for dominance, comes in the form of the Clayarks, mutated humans sick from an interplanetary virus, who are partially immune to Patternist control and who exist in the wild spaces between Patternist enclaves.
Dataset: astroBlackness. Source: Social Text - Afrofuturist Issue. Author: Tracie Morris. Publisher: Duke University Press. Year: 2002. Page: 93. Paragraph: 1.
My first word was an error
according to the machine I spoke it in.
Whispering into an orifice used to be intimate,
Now, the Neural Network Noir twitters from every misplaced
exclamation. Deep spell check.
Dataset: astroBlackness. Source: The Immeasurable Equation. Author: Sun Ra. Publisher: Hartmut Geerken. Year: 2005. Page: 306.
· = aim
· = end
· = period
· = time
· = era
· = age
· = cycle
Dataset: nearbySaussure. Source: The Cambridge Companion to Saussure. Author: Peter Wunderli. Publisher: Cambridge University Press. Year: 2006. Page: 21. Paragraph: 3.
All these concessions open the door to coincidence and finally turn the anagram into a phenomenon of probability. In the end one is left wondering whether it is not possible to extract any word out of any text of a certain length.
Dataset: nearbySaussure. Source: Saussure, Derrida, and the Metaphysics of Subjectivity. Author: Robert M. Strozier. Publisher: Mouton de Gruyter. Year: 1988. Page: 21. Paragraph: 3.
“the linguist's task is not to study utterances for their own sake … but to provide evidence about the nature of the underlying system, the English language”
Dataset: nearbySaussure. Source: Saussure, Derrida, and the Metaphysics of Subjectivity. Author: Robert M. Strozier. Publisher: Mouton de Gruyter. Year: 1988. Page: 49. Paragraph: 2.
It is the point of view, as Saussure says, which determines the fact or entity--the object--within the mass of data. “Point of view” is a technical term for Saussure: it is another term for form; that is, the form represents what is induced upon the mass of data, now considered as a material capable of accepting the various forms.
Dataset: nearbySaussure. Source: Saussure's Philosophy of Language as Phenomenology. Author: Beata Stawarska. Publisher: Oxford University Press. Year: 2015. Page: 43. Paragraph: 1.
Temporality proper to language itself intersects the signifier and the signified in a relation of mutual interdependency; this naturally makes the ideas themselves subject to mutation and change. Importantly, the source materials do not support the structuralist notion of a fixed and immutable system; such a notion emerges rather as the philosophers’ abstraction removed from the concrete reality of language in usage;
Dataset: Learning from Deep Learning. Source: Deep Learning for Java. Author: Skymind.
The vectors we use to represent words are called neural word embeddings, and representations are strange. One thing describes another, even though those two things are radically different. As Elvis Costello said: “Writing about music is like dancing about architecture”.
Dataset: Learning from Deep Learning. Source: Deep Learning. Author: Ian Goodfellow and Yoshua Bengio and Aaron Courville. Page: 112. Paragraph: 1.
In practice, the learning algorithm does not actually find the best function, but merely one that significantly reduces the training error. These additional limitations, such as the imperfection of the optimization algorithm, mean that the learning algorithm’s effective capacity may be less than the representational capacity of the model family.
Dataset: Learning from Deep Learning. Source: Deep Learning. Author: Ian Goodfellow and Yoshua Bengio and Aaron Courville. Page: 472. Paragraph: 3.
The idea of forming an embedding for a symbol was extended to the idea of an embedding for a word by Deerwester et al. (1990). These embeddings were learned using the SVD. Later, embeddings would be learned by neural networks.
Dataset: Frankenstein. Source: Frankenstein. Author: Mary Shelley.
My education was neglected, yet I was passionately fond of reading.
Dataset: Frankenstein. Source: Frankenstein. Author: Mary Shelley.
The different accidents of life are not so changeable as the feelings of human nature.
Dataset: Frankenstein. Source: Frankenstein. Author: Mary Shelley.
Invention, it must be humbly admitted, does not consist in creating out of void, but out of chaos; the materials must, in the first place be afforded: it can give form to dark shapeless substances but cannot bring into being the substance itself.