Actions

The data (e)speaks: Difference between revisions

From Algolit

m
 
(3 intermediate revisions by 2 users not shown)
Line 12: Line 12:
 
In the process of making the Algolit datasets, careful consideration was given to the selection of the source texts. Our attempt was to have a variety of tone of voices that highlights the heterogeneity of all of them combined.
 
In the process of making the Algolit datasets, careful consideration was given to the selection of the source texts. Our attempt was to have a variety of tone of voices that highlights the heterogeneity of all of them combined.
  
The texts were gathered from aaaaarg.fail, gen.lib.rus.ec, archive.org and gutenberg.org, run through terminal commands such as [https://en.wikipedia.org/wiki/Pdftotext pdftotext] in order to generate .txt files and stripped of punctuation marks with the help of [https://gitlab.constantvzw.org/algolit/algolit/blob/master/algoliterary_encounter/algoliterary-toolkit/text-punctuation-clean-up.py a Python code snippet].
+
The texts were gathered from aaaaarg.fail, gen.lib.rus.ec, archive.org and gutenberg.org, processed with terminal commands such as [https://en.wikipedia.org/wiki/Pdftotext pdftotext] in order to generate .txt files and stripped of punctuation marks with the help of [https://gitlab.constantvzw.org/algolit/algolit/blob/master/algoliterary_encounter/algoliterary-toolkit/text-punctuation-clean-up.py a Python code snippet].
  
 
The ensuing datasets are:
 
The ensuing datasets are:
Line 20: Line 20:
 
* [[astroBlackness]]
 
* [[astroBlackness]]
  
''The data (e)speaks'' is an audio installation that gives a voice to the datasets by selecting specific sentences from the text body.
+
''The data (e)speaks'' is an audio installation that reads out specific sentences from their text body.
  
*Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''145'''. Paragraph: '''3'''.  
+
<small>Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''145'''. Paragraph: '''3'''.</small><br>
 
''She returned over and over again to world-building aspects of the narrative — the social organization of the colony, the physics governing Bow, the possible organisms that might live in such a place, Earth-historical parallels to their situation—but could never get the plot itself to gel.''
 
''She returned over and over again to world-building aspects of the narrative — the social organization of the colony, the physics governing Bow, the possible organisms that might live in such a place, Earth-historical parallels to their situation—but could never get the plot itself to gel.''
  
*Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''151'''. Paragraph: '''3'''.
+
<small>Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''186'''. Paragraph: '''3'''.</small><br>
''Bow was a place where history wasn’t the unhappy curse we were condemned to, a place where the tug-of-war between collective survival and collective insanity might play itself out in another way.''
 
 
 
*Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''186'''. Paragraph: '''3'''.  
 
 
''More important than technique, however, is for authors to remember that they are writing about people. Authors who forget this, who do not relax and get comfortable with their racially different characters, can wind up creating unbelievable, self-consciously manipulated puppets; pieces of furniture who exist within a story but contribute nothing to it; or stereotypes guaranteed to be offensive.''
 
''More important than technique, however, is for authors to remember that they are writing about people. Authors who forget this, who do not relax and get comfortable with their racially different characters, can wind up creating unbelievable, self-consciously manipulated puppets; pieces of furniture who exist within a story but contribute nothing to it; or stereotypes guaranteed to be offensive.''
  
*Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''46'''. Paragraph: '''5'''.  
+
<small>Dataset: '''[[astroBlackness]]'''. Source: '''Octavia E. Butler'''. Author: '''Gerry Canavan'''. Publisher: '''University of Illinois Press'''. Year: '''2016'''. Page: '''46'''. Paragraph: '''5'''.</small><br>
 
''The sole competitor to the unchallenged hegemony of the Patternists, beyond the social instability implied by their own endless internal struggle for dominance, comes in the form of the Clayarks, mutated humans sick from an interplanetary virus, who are partially immune to Patternist control and who exist in the wild spaces between Patternist enclaves.''
 
''The sole competitor to the unchallenged hegemony of the Patternists, beyond the social instability implied by their own endless internal struggle for dominance, comes in the form of the Clayarks, mutated humans sick from an interplanetary virus, who are partially immune to Patternist control and who exist in the wild spaces between Patternist enclaves.''
  
*Dataset: '''[[astroBlackness]]'''. Source: '''Social Text - Afrofuturist Issue'''. Author: '''Tracie Morris'''. Publisher: '''Duke University Press'''. Year: '''2002'''. Page: '''93'''. Paragraph: '''1'''.  
+
<small>Dataset: '''[[astroBlackness]]'''. Source: '''Social Text - Afrofuturist Issue'''. Author: '''Tracie Morris'''. Publisher: '''Duke University Press'''. Year: '''2002'''. Page: '''93'''. Paragraph: '''1'''.</small><br>
 
''My first word was an error<br>''
 
''My first word was an error<br>''
 
''according to the machine I spoke it in.<br>''
 
''according to the machine I spoke it in.<br>''
Line 41: Line 38:
 
''exclamation. Deep spell check.''
 
''exclamation. Deep spell check.''
  
*Dataset: ''[[astroBlackness]]''. Source: ''The Immeasurable Equation''. Author: ''Sun Ra''. Publisher: ''Hartmut Geerken''. Year: ''2005''. Page: ''306''.
+
<small>Dataset: '''[[astroBlackness]]'''. Source: '''The Immeasurable Equation'''. Author: '''Sun Ra'''. Publisher: '''Hartmut Geerken'''. Year: '''2005'''. Page: '''306'''.</small><br>
 
''· = aim<br/>''
 
''· = aim<br/>''
 
''· = end<br/>''
 
''· = end<br/>''
Line 50: Line 47:
 
''· = cycle<br/>''
 
''· = cycle<br/>''
  
*Dataset: '''[[nearbySaussure]]'''. Source: '''The Cambridge Companion to Saussure'''. Author: '''Peter Wunderli'''. Publisher: '''Cambridge University Press'''. Year: '''2006'''. Page: '''21'''. Paragraph: '''3'''.  
+
<small>Dataset: '''[[nearbySaussure]]'''. Source: '''The Cambridge Companion to Saussure'''. Author: '''Peter Wunderli'''. Publisher: '''Cambridge University Press'''. Year: '''2006'''. Page: '''21'''. Paragraph: '''3'''.</small><br>
 
''All these concessions open the door to coincidence and finally turn the anagram into a phenomenon of probability. In the end one is left wondering whether it is not possible to extract any word out of any text of a certain length.''
 
''All these concessions open the door to coincidence and finally turn the anagram into a phenomenon of probability. In the end one is left wondering whether it is not possible to extract any word out of any text of a certain length.''
  
*Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure, Derrida, and the Metaphysics of Subjectivity'''. Author: '''Robert M. Strozier'''. Publisher: '''Mouton de Gruyter'''. Year: '''1988'''. Page: '''21'''. Paragraph: '''3''.  
+
<small>Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure, Derrida, and the Metaphysics of Subjectivity'''. Author: '''Robert M. Strozier'''. Publisher: '''Mouton de Gruyter'''. Year: '''1988'''. Page: '''21'''. Paragraph: '''3'''.</small><br>
''"the linguist's task is not to study utterances for their own sake . . . but to provide evidence about the nature of the underlying system, the English language" ''
+
''“the linguist's task is not to study utterances for their own sake but to provide evidence about the nature of the underlying system, the English language”''
  
*Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure, Derrida, and the Metaphysics of Subjectivity'''. Author: '''Robert M. Strozier'''. Publisher: '''Mouton de Gruyter'''. Year: '''1988'''. Page: '''49'''. Paragraph: '''2''.
+
<small>Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure, Derrida, and the Metaphysics of Subjectivity'''. Author: '''Robert M. Strozier'''. Publisher: '''Mouton de Gruyter'''. Year: '''1988'''. Page: '''49'''. Paragraph: '''2'''.</small><br>
''It is the point of view, as Saussure says, which determines the fact or entity--the object--within the mass of data. "Point of view" is a technical term for Saussure: it is another term for form; that is, the form represents what is induced upon the mass of data, now considered as a material capable of accepting the various forms.''
+
''It is the point of view, as Saussure says, which determines the fact or entity--the object--within the mass of data. “Point of view” is a technical term for Saussure: it is another term for form; that is, the form represents what is induced upon the mass of data, now considered as a material capable of accepting the various forms.''
  
*Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure's Philosophy of Language as Phenomenology'''. Author: '''Beata Stawarska'''. Publisher: '''Oxford University Press'''. Year: '''2015'''. Page: '''43'''. Paragraph: '''1''.
+
<small>Dataset: '''[[nearbySaussure]]'''. Source: '''Saussure's Philosophy of Language as Phenomenology'''. Author: '''Beata Stawarska'''. Publisher: '''Oxford University Press'''. Year: '''2015'''. Page: '''43'''. Paragraph: '''1'''.</small><br>
 
''Temporality proper to language itself intersects the signifier and the signified in a relation of mutual interdependency; this naturally makes the ideas themselves subject to mutation and change. Importantly, the source materials do not support the structuralist notion of a fixed and immutable system; such a notion emerges rather as the philosophers’ abstraction removed from the concrete reality of language in usage;''
 
''Temporality proper to language itself intersects the signifier and the signified in a relation of mutual interdependency; this naturally makes the ideas themselves subject to mutation and change. Importantly, the source materials do not support the structuralist notion of a fixed and immutable system; such a notion emerges rather as the philosophers’ abstraction removed from the concrete reality of language in usage;''
  
*Dataset: '''[[Learning from Deep Learning]]'''. Source: '''[https://deeplearning4j.org/word2vec#embed Deep Learning for Java]'''. Author: '''Skymind'''.
+
<small>Dataset: '''[[Learning from Deep Learning]]'''. Source: '''[https://deeplearning4j.org/word2vec#embed Deep Learning for Java]'''. Author: '''Skymind'''.</small><br>
''The vectors we use to represent words are called neural word embeddings, and representations are strange. One thing describes another, even though those two things are radically different. As Elvis Costello said: “Writing about music is like dancing about architecture.''
+
''The vectors we use to represent words are called neural word embeddings, and representations are strange. One thing describes another, even though those two things are radically different. As Elvis Costello said: “Writing about music is like dancing about architecture”.''
  
*Dataset: '''[[Learning from Deep Learning]]'''. Source: '''Deep Learning'''. Author: '''Ian Goodfellow and Yoshua Bengio and Aaron Courville'''. Page: '''112'''. Paragraph: '''1'''.
+
<small>Dataset: '''[[Learning from Deep Learning]]'''. Source: '''Deep Learning'''. Author: '''Ian Goodfellow and Yoshua Bengio and Aaron Courville'''. Page: '''112'''. Paragraph: '''1'''.</small><br>
'' In practice, thelearning algorithm does not actually find the best function, but merely one thatsignificantly reduces the training error. These additional limitations, such as theimperfection of the optimization algorithm, mean that the learning algorithm’seffective capacitymay be less than the representational capacity of the modelfamily.''
+
''In practice, the learning algorithm does not actually find the best function, but merely one that significantly reduces the training error. These additional limitations, such as the imperfection of the optimization algorithm, mean that the learning algorithm’s effective capacity may be less than the representational capacity of the model family.''
  
*Dataset: '''[[Learning from Deep Learning]]'''. Source: '''Deep Learning'''. Author: '''Ian Goodfellow and Yoshua Bengio and Aaron Courville'''. Page: '''472'''. Paragraph: '''3'''.
+
<small>Dataset: '''[[Learning from Deep Learning]]'''. Source: '''Deep Learning'''. Author: '''Ian Goodfellow and Yoshua Bengio and Aaron Courville'''. Page: '''472'''. Paragraph: '''3'''.</small><br>
''The idea of forming an embedding for a symbol was extended to the idea of anembedding for a word by Deerwester et al. (1990). These embeddings were learnedusing the SVD. Later, embeddings would be learned by neural networks.''
+
''The idea of forming an embedding for a symbol was extended to the idea of an embedding for a word by Deerwester et al. (1990). These embeddings were learned using the SVD. Later, embeddings would be learned by neural networks.''
  
*Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.  
+
<small>Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.</small><br>
 
''My education was neglected, yet I was passionately fond of reading.''
 
''My education was neglected, yet I was passionately fond of reading.''
  
*Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.  
+
<small>Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.</small><br>
 
''The different accidents of life are not so changeable as the feelings of human nature.''
 
''The different accidents of life are not so changeable as the feelings of human nature.''
  
*Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.  
+
<small>Dataset: '''[[Frankenstein]]'''. Source: '''Frankenstein'''. Author: '''Mary Shelley'''.</small><br>
 
''Invention, it must be humbly admitted, does not consist in creating out of void, but out of chaos; the materials must, in the first place be afforded: it can give form to dark shapeless substances but cannot bring into being the substance itself.''
 
''Invention, it must be humbly admitted, does not consist in creating out of void, but out of chaos; the materials must, in the first place be afforded: it can give form to dark shapeless substances but cannot bring into being the substance itself.''
  
  
 
[[Category:Algoliterary-Encounters]]
 
[[Category:Algoliterary-Encounters]]

Latest revision as of 19:06, 1 November 2017

Type: Algoliterary exploration
Datasets: Frankenstein, Learning from Deep Learning, nearbySaussure, astroBlackness
Technique: espeak
Developed by: & Algolit

In the process of making the Algolit datasets, careful consideration was given to the selection of the source texts. Our attempt was to have a variety of tone of voices that highlights the heterogeneity of all of them combined.

The texts were gathered from aaaaarg.fail, gen.lib.rus.ec, archive.org and gutenberg.org, processed with terminal commands such as pdftotext in order to generate .txt files and stripped of punctuation marks with the help of a Python code snippet.

The ensuing datasets are:

The data (e)speaks is an audio installation that reads out specific sentences from their text body.

Dataset: astroBlackness. Source: Octavia E. Butler. Author: Gerry Canavan. Publisher: University of Illinois Press. Year: 2016. Page: 145. Paragraph: 3.
She returned over and over again to world-building aspects of the narrative — the social organization of the colony, the physics governing Bow, the possible organisms that might live in such a place, Earth-historical parallels to their situation—but could never get the plot itself to gel.

Dataset: astroBlackness. Source: Octavia E. Butler. Author: Gerry Canavan. Publisher: University of Illinois Press. Year: 2016. Page: 186. Paragraph: 3.
More important than technique, however, is for authors to remember that they are writing about people. Authors who forget this, who do not relax and get comfortable with their racially different characters, can wind up creating unbelievable, self-consciously manipulated puppets; pieces of furniture who exist within a story but contribute nothing to it; or stereotypes guaranteed to be offensive.

Dataset: astroBlackness. Source: Octavia E. Butler. Author: Gerry Canavan. Publisher: University of Illinois Press. Year: 2016. Page: 46. Paragraph: 5.
The sole competitor to the unchallenged hegemony of the Patternists, beyond the social instability implied by their own endless internal struggle for dominance, comes in the form of the Clayarks, mutated humans sick from an interplanetary virus, who are partially immune to Patternist control and who exist in the wild spaces between Patternist enclaves.

Dataset: astroBlackness. Source: Social Text - Afrofuturist Issue. Author: Tracie Morris. Publisher: Duke University Press. Year: 2002. Page: 93. Paragraph: 1.
My first word was an error
according to the machine I spoke it in.
Whispering into an orifice used to be intimate,
Now, the Neural Network Noir twitters from every misplaced
exclamation. Deep spell check.

Dataset: astroBlackness. Source: The Immeasurable Equation. Author: Sun Ra. Publisher: Hartmut Geerken. Year: 2005. Page: 306.
· = aim
· = end
· = period
· = time
· = era
· = age
· = cycle

Dataset: nearbySaussure. Source: The Cambridge Companion to Saussure. Author: Peter Wunderli. Publisher: Cambridge University Press. Year: 2006. Page: 21. Paragraph: 3.
All these concessions open the door to coincidence and finally turn the anagram into a phenomenon of probability. In the end one is left wondering whether it is not possible to extract any word out of any text of a certain length.

Dataset: nearbySaussure. Source: Saussure, Derrida, and the Metaphysics of Subjectivity. Author: Robert M. Strozier. Publisher: Mouton de Gruyter. Year: 1988. Page: 21. Paragraph: 3.
“the linguist's task is not to study utterances for their own sake … but to provide evidence about the nature of the underlying system, the English language”

Dataset: nearbySaussure. Source: Saussure, Derrida, and the Metaphysics of Subjectivity. Author: Robert M. Strozier. Publisher: Mouton de Gruyter. Year: 1988. Page: 49. Paragraph: 2.
It is the point of view, as Saussure says, which determines the fact or entity--the object--within the mass of data. “Point of view” is a technical term for Saussure: it is another term for form; that is, the form represents what is induced upon the mass of data, now considered as a material capable of accepting the various forms.

Dataset: nearbySaussure. Source: Saussure's Philosophy of Language as Phenomenology. Author: Beata Stawarska. Publisher: Oxford University Press. Year: 2015. Page: 43. Paragraph: 1.
Temporality proper to language itself intersects the signifier and the signified in a relation of mutual interdependency; this naturally makes the ideas themselves subject to mutation and change. Importantly, the source materials do not support the structuralist notion of a fixed and immutable system; such a notion emerges rather as the philosophers’ abstraction removed from the concrete reality of language in usage;

Dataset: Learning from Deep Learning. Source: Deep Learning for Java. Author: Skymind.
The vectors we use to represent words are called neural word embeddings, and representations are strange. One thing describes another, even though those two things are radically different. As Elvis Costello said: “Writing about music is like dancing about architecture”.

Dataset: Learning from Deep Learning. Source: Deep Learning. Author: Ian Goodfellow and Yoshua Bengio and Aaron Courville. Page: 112. Paragraph: 1.
In practice, the learning algorithm does not actually find the best function, but merely one that significantly reduces the training error. These additional limitations, such as the imperfection of the optimization algorithm, mean that the learning algorithm’s effective capacity may be less than the representational capacity of the model family.

Dataset: Learning from Deep Learning. Source: Deep Learning. Author: Ian Goodfellow and Yoshua Bengio and Aaron Courville. Page: 472. Paragraph: 3.
The idea of forming an embedding for a symbol was extended to the idea of an embedding for a word by Deerwester et al. (1990). These embeddings were learned using the SVD. Later, embeddings would be learned by neural networks.

Dataset: Frankenstein. Source: Frankenstein. Author: Mary Shelley.
My education was neglected, yet I was passionately fond of reading.

Dataset: Frankenstein. Source: Frankenstein. Author: Mary Shelley.
The different accidents of life are not so changeable as the feelings of human nature.

Dataset: Frankenstein. Source: Frankenstein. Author: Mary Shelley.
Invention, it must be humbly admitted, does not consist in creating out of void, but out of chaos; the materials must, in the first place be afforded: it can give form to dark shapeless substances but cannot bring into being the substance itself.