{"id":1768,"date":"2011-07-16T11:00:41","date_gmt":"2011-07-16T10:00:41","guid":{"rendered":"http:\/\/jodischneider.com\/blog\/?p=1768"},"modified":"2011-07-16T11:00:41","modified_gmt":"2011-07-16T10:00:41","slug":"qotd-move-the-computation-to-the-data-the-future-of-nonconsumptive-research-with-google-books","status":"publish","type":"post","link":"https:\/\/jodischneider.com\/blog\/2011\/07\/16\/qotd-move-the-computation-to-the-data-the-future-of-nonconsumptive-research-with-google-books\/","title":{"rendered":"QOTD: &#8220;move the computation to the data&#8221;: the future of nonconsumptive research with Google Books"},"content":{"rendered":"<p>Douglas Knox <a title=\"Digital Humanities 2011 and the elephant in the tent\" href=\"http:\/\/beingnumero.us\/blog\/2011\/07\/digital-humanities-2011-and-the-elephant-in-the-tent\/\">touches on<\/a> the future of &#8220;distant reading&#8221; ((What&#8217;s &#8220;distant reading&#8221;? Think &#8220;text mining of literature&#8221;&#8211;but it&#8217;s deeper than that. It&#8217;s also <a href=\"http:\/\/www.stanford.edu\/~mjockers\/cgi-bin\/drupal\/node\/59\">called the macroeconomics of literature (&#8220;macroanalysis&#8221;)<\/a> and <a title=\"What is distant reading?\" href=\"http:\/\/www.nytimes.com\/2011\/06\/26\/books\/review\/the-mechanic-muse-what-is-distant-reading.html?_r=1<\/a> scoffed at by the NYTimes, who don&#8217;t get the deeper underlying purpose.)) with Google Books. ((By the way, what approach is the <a title=\"Hathi Trust\" href=\"http:\/\/www.hathitrust.org\/\">Hathi Trust<\/a> taking?))<\/p>\n<blockquote><p>For rights management reasons and also for material engineering reasons, the research architecture will\u00a0<em>move the computation to the data.<\/em> That is, the vision of the future here is not one in which major data providers give access to data in big downloadable chunks for reuse and querying in other contexts, but one in which researchers\u2019 queries are somehow formalized in code that the data provider\u2019s servers will run on the researcher\u2019s behalf, presumably also producing\u00a0<a href=\"https:\/\/secure.wikimedia.org\/wikipedia\/en\/wiki\/42_%28number%29\">economically sized<\/a> result sets.<\/p><\/blockquote>\n<p>There are also some implicit research goals, for those in cyberinfrastructure, digital humanities support, and people in text mining aiming at supporting humanities scholars:<\/p>\n<blockquote><p>Whatever we mean by \u201ccomputation,\u201d that is, can\u2019t be locked up in an interface that tightly binds computation and data. Readers already need (and for the most part do not have) our own agents and our own data, our own algorithms for testing, validating, calibrating, and recording our interaction with the black boxes of external infrastructure.<\/p><\/blockquote>\n<p>This kind of blackbox infrastructure contrasts with &#8220;using technology critically and experimentally, fiddling with knobs to see what happens, and adjusting based on what they find.&#8221; when a scholar is &#8220;free to write short scripts and see results in quick cycles of exploration&#8221;.<\/p>\n<p>I&#8217;m pulling these out of context &#8212; from <a title=\"Digital Humanities 2011 and the elephant in the tent\" href=\"http:\/\/beingnumero.us\/blog\/2011\/07\/digital-humanities-2011-and-the-elephant-in-the-tent\/\">Douglas&#8217; post<\/a> on the <a title=\"Digital Humanities 2011\" href=\"https:\/\/dh2011.stanford.edu\/\">Digital Humanities 2011 conference<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Douglas Knox touches on the future of &#8220;distant reading&#8221; ((What&#8217;s &#8220;distant reading&#8221;? Think &#8220;text mining of literature&#8221;&#8211;but it&#8217;s deeper than that. It&#8217;s also called the macroeconomics of literature (&#8220;macroanalysis&#8221;) and<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5,85],"tags":[395,394,396,166,397],"class_list":["post-1768","post","type-post","status-publish","format-standard","hentry","category-books-and-reading","category-information-ecosystem","tag-dh11","tag-digital-humanities","tag-distant-reading","tag-google-books","tag-macroanalysis"],"_links":{"self":[{"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/posts\/1768","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/comments?post=1768"}],"version-history":[{"count":7,"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/posts\/1768\/revisions"}],"predecessor-version":[{"id":1775,"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/posts\/1768\/revisions\/1775"}],"wp:attachment":[{"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/media?parent=1768"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/categories?post=1768"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jodischneider.com\/blog\/wp-json\/wp\/v2\/tags?post=1768"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}