OK, here’s a more coherent entry for my Digital Humanities Encyclopedia project.
The term big data is actually descriptive of the underlying phenomenon to which it refers. Big data refers to the streams of bits, numbers, texts, images, interactions, stimuli, or information that is too massive in quantity to process using single computers or even small clusters of machines. In the field of digital humanities, practitioners fuse the use of data (big and not so big) and technical tools to ask and answer humanist questions. An example to illustrate this is the project written up in Quantitative Analysis of Culture Using Millions of Digitized Books. In this project, the authors analyzed a subset of the Google Books corpus to shed insight into issues like cultural forgetfulness, censorship, and changes in language use.
Have a read and check out additional resources listed below. Remember that if there’s anything you don’t like, you are empowered to change it!
- Academic (but engaging) articles on the phenomenon:
Bell, G., Hey, T. & Szalay, A. (2009). Beyond the data deluge. Science, 323: 1297-1298. http://www.sciencemag.org/content/323/5919/1297.full
Crane, G. R. (2006). What do you do with a million books? D-Lib Magazine, 12(3). http://www.dlib.org/dlib/march06/crane/03crane.html
Frederick W. Gibbs and Trevor J. Owens. The Hermeneutics of Data and Historical Writing. http://writinghistory.trincoll.edu/data/gibbs-owens-2012-spring
- Image (linked to a Smithsonian article): “big data is getting bigger at at a stunning rate”
- Blog post: from a digital humanities classmate, Roderic Crooks, Big Data, Polls, and History
- Tags: when talking about big data, people may use any of the following terms, big data, research data, data deluge, big science, fourth paradigm
- Twitter hashtag: #bigdata is the suggested hashtag to use when Tweeting about big data. It’s already being used a aplenty in Twitter-land.