Archive for the ‘books and reading’ Category

Annotating PDFs on an iPad: GoodReader and iAnnotatePDF

July 31st, 2011

Colleagues were interested in my recommendations for iPad annotation: GoodReader and iAnnotatePDF. Here’s a brief comparison.

Both save Acrobat-compatible annotations, which can be exported out as text (for instance to see everything you’ve highlighted yellow), offer synching, and multiple styles of annotation. The exact annotation workflow and navigation differ somewhat.

GoodReader’s main strength is the ability to easily pinpoint the exact boundaries of an annotation: a circular magnifying ‘loope’ window automatically pops up. GoodReader also warns you when scanned images don’t have text behind them (offering to OCR them would be a welcome, though challenging enhancement: it would be enough to put them into an OCR-queue you could have Acrobat Pro watch and act on). One weakness (for me at least) is that to get the toolmenu, you must tap in the middle of the screen. My fingers seem expect it to pop up when you tap on the right-hand side of the screen: sometimes that advances the page, but sometimes that just changes the view on the current page. Further, I find its small black-and-white icons somewhat confusing.

I prefer iAnnotatePDF, especially because it saves annotations by default, has customizable navigation, and clearer icons. Its key strength is that annotations are auto-saved, with ‘undo’, ‘delete’, and ‘edit’ functions. Further, the annotation type is maintained between annotations, until you (say) put down the highlighter by clicking an x. This is a small weakness since I find that to switch pages I have to close the annotation tool I’m currently using. Another weakness is that there’s a limited time window for editing existing annotations: just after they are created, annotations can be adjusted, for instance to move the boundaries of text highlights and underlines. Yet after this period has expired, annotations can be deleted, but locations cannot be adjusted (as far as I can tell). Another weakness is that interacting with image-only PDFs can be confusing; without any text, some functions (text highlight, text underline, …) just don’t work, without any warning or notice.

I would be interested in hearing comparisons of the syncing functionality, as well as comparisons to PDFExpert.

Criterion  GoodReader  iAnnotatePDF 
Pageview  default is snap to page (double-spreads show left-to-right)  flow (can see parts of 2 pages at once, top-to-bottom) 
Saving annotations  Must save each annotation  Annotations automatically save 
Navigation  tap left/right to navigate forward/back; scroll only shows the same page  tap, slide, or swipe to navigate (customizable)  
Toolbar  tap in the middle  tap on the right 
Icons  black & white, some are obscure   medium-sized color, some are clearly understandable  

Posted in books and reading, iOS: iPad, iPhone, etc. | Comments (1)

How do you organize papers on your iPad?

July 31st, 2011

You read papers, right? How do you store and organize them? I’m looking for advice on a workflow for annotating PDFs and syncing between devices.

I’m striking out on iPad apps for organizing scholarly papers. Papers2 doesn’t pull annotated copies back. Mendeley lite doesn’t even let me log in ((In Mendeley v1.3.1 (build 19) when I enter my login details, the only option is ‘close’. After closing, Mendeley reports “Not logged in”. Yes, I’ve double-checked my password!)). Zotero, which has been my main reference manager for at least 5 years, doesn’t offer an iPad app.

For annotation, I like iAnnotatePDF and GoodReader (and I’m getting ready to try PDFExpert). What I don’t know is how to have manageable filenames, when the documents originate in another iPad app, instead of on the desktop.

The only ideas I have left involve either spending more time with filemanagers or relying on the synching inside the annotation tools.

Reference managers/PDF managers:

  1. Try Sente
  2. See what DevonThink Pro Office can do, maybe with Zotero export others have worked on. Surely that’s overkill?

Synching from annotation tools:

iAnnotate or GoodReader can “watch” folders. Main challenge is going to be coming up with a sufficiently small collection of PDFs to sync back and forth to the iPad.

  1. Stick with Zotero, maybe with files renamed from Zotfile, then use iAnnotatePDF’s “watch folder” feature to keep in sync.
  2. Stick with Papers2, manually manage the file synch for everything I’ve annotated, then use watch its data directory with iAnnotatePDF as above.
  3. Try Mendeley, watch its data directory with iAnnotatePDF as above.

Thoughts and suggestions? What would you do?

Posted in books and reading, iOS: iPad, iPhone, etc. | Comments (1)

Papers2 does not integrate with external iPad applications in the way I expected

July 31st, 2011

Papers2 does not integrate with external iPad applications in the way I expected. I use iPad applications like GoodReader, iAnnotatePDF, and PDFExpert to read and annotate papers.

The functionality I expected was:

  • Export from Papers to an external PDF annotation application
  • When I reopen Papers, the annotated PDF is shown in my library

However, here is what happens:

  • Export from Papers to an external PDF annotation application. It renames the file, using a random string as the filename.
  • When I reopen Papers, only the original (unannotated) PDF is in my library.
  • Alternately when I export from the external application, the annotated file is imported as a *new* PDF, unconnected to the original, with a random string used for the filename.

I started using Papers because managing filenames in iAnnotate wasn’t working: I couldn’t figure out which files were which. So this is absolutely key for me.

==

This is a bug report to Papers2, copied here since bug reports are private. Any workarounds or suggestions for alternate annotation/reference management workflows would be very welcome.

This annotation environment completely failed to meet my expectations: I expected to ‘Open In’ an annotation application; in fact there’s just ‘Export’ and ‘Import’, meaning that the annotated file isn’t automatically stored in the Papers2 library.

Tags: , , , , , , ,
Posted in books and reading, iOS: iPad, iPhone, etc. | Comments (1)

Reading Ontologically?

July 24th, 2011

What are the right ontologies for reading? And what kind of ontology support would let books recombine themselves, on the fly, in novel ways?

Today keyword searches within books and book collections is commonplace, highlighting a word in your ebook reader can bring up a definition, and dictionaries grab recent examples of word use from microblogs. ((In 2003, Gregory Crane wrote that “Already the books in a digital library are beginning to read one another and to confer among themselves before creating a new synthetic document for review by their human readers.” When I first read it in 2006, that article seemed incredibly visionary to me. Yet these commonplace “syntheses” no longer seem extraordinary to me.)) But can’t we do more? But what kind of synthesis do we need (and what is possible) for supporting readers of literature, classics, and humanities texts?

Current approaches seem to aim at analysis (e.g. getting an overview of the literary works of a period with “distant reading”/”macroanalysis”) and at creating flexible critical editions (e.g. structural, sometimes overlapping markup, as in TEI-based editions and projects like Wendell Piez’ Sonneteer ((currently offline, but brilliant; do check back, meanwhile see also his Digital Humanities 2010 talk notes)).) I would call these “sensemaking” approaches rather than tools for reading.

I was intrigued by the Bible Ontology ((It’s a bit disingenuous to advertise their work as an ontology: in fact they have applied the ontology, rather than just creating it.)) because of their tagline: “ever wanted to read and study the Bible Ontologically?” Yet I don’t really know what they mean by reading ontologically ((even though I’ve given a talk about supporting reading with ontologies!)).

Of course, they have recorded various pieces of data. For instance, for Rebekah, we see her children, siblings, birthplace, book and chapters she figures in, etc.: http://bibleontology.com/page/Rebekah. ((The most meaningful of their terms is the bop:isRelatedInEvent, perhaps since these events, like Isaac_blesses_Jacob, would require more analysis to discern.))

Rebekah, from bibleontology.com

They offer a SPARQL endpoint, so you can query. For instance, to find all the married women ((Gender is not recorded so we can’t (yet) ask for all the women overall, though I’ve just asked about this.)) (live query result):

PREFIX bop: <http://bibleontology.com/property/>
select ?s ?o where {?s bop:isWifeOf ?o }

Intense and long-term work has gone into Bible concordances, scholarship, etc., so it seems like a great use case for “reading ontologically”. With theologians and others looking at the site, using the SPARQL endpoint, etc., perhaps someone will be able to tell me what that means!

Tags: , , , , ,
Posted in books and reading, future of publishing, semantic web | Comments (0)

QOTD: “move the computation to the data”: the future of nonconsumptive research with Google Books

July 16th, 2011

Douglas Knox touches on the future of “distant reading” ((What’s “distant reading”? Think “text mining of literature”–but it’s deeper than that. It’s also called the macroeconomics of literature (“macroanalysis”) and Hathi Trust taking?))

For rights management reasons and also for material engineering reasons, the research architecture will move the computation to the data. That is, the vision of the future here is not one in which major data providers give access to data in big downloadable chunks for reuse and querying in other contexts, but one in which researchers’ queries are somehow formalized in code that the data provider’s servers will run on the researcher’s behalf, presumably also producing economically sized result sets.

There are also some implicit research goals, for those in cyberinfrastructure, digital humanities support, and people in text mining aiming at supporting humanities scholars:

Whatever we mean by “computation,” that is, can’t be locked up in an interface that tightly binds computation and data. Readers already need (and for the most part do not have) our own agents and our own data, our own algorithms for testing, validating, calibrating, and recording our interaction with the black boxes of external infrastructure.

This kind of blackbox infrastructure contrasts with “using technology critically and experimentally, fiddling with knobs to see what happens, and adjusting based on what they find.” when a scholar is “free to write short scripts and see results in quick cycles of exploration”.

I’m pulling these out of context — from Douglas’ post on the Digital Humanities 2011 conference.

Tags: , , , ,
Posted in books and reading, information ecosystem | Comments (0)

QOTD: Stop crippling ebooks: invent new business models instead

May 16th, 2011

Holding on to old business models is not the way to endear yourself to customers.

But unfortunately this is also, simultaneously, a bad time to be a reader. Because the dinosaurs still don’t get it. Ten years of object lessons from the music industry, and they still don’t get it. We have learned, painfully, that media consumers—be they listeners, watchers, or readers—want one of two things:

  • DRM-free works for a reasonable price
  • or, unlimited single-payment subscription to streaming/DRMed works

Give them either of those things, and they’ll happily pay. Look at iTunes. Look at Netflix. But give them neither, and they’ll pirate. So what are publishers doing?

  • Refusing to sell DRM-free books. My debut novel will be re-e-published by the Friday Project imprint of HarperCollins UK later this year; both its editor and I would like it to be published without DRM; and yet I doubt we will be able to make that happen.
  • crippling library e-books
  • and not offering anything even remotely like a subscription service.

– Jon Evans, When Dinosaurs Ruled the Books, via James Bridle’s Stop Press

Eric Hellman is one of the pioneers of tomorrow’s ebook business models: his company, Gluejar, uses a crowdfunding model to re-release books under Creative Commons licenses. Authors and publishers are paid; fans pay for the books they’re most interested in; and everyone can read and distribute the resulting “unglued” ebooks. Everybody wins.

Tags: , , , , ,
Posted in books and reading, future of publishing, information ecosystem | Comments (0)

Reading styles

March 2nd, 2011

To support reading, think about diversity of reading styles.

A study of “How examiners assess research theses” mentions the diversity:

[F]our examples give a good indication of the range of ‘reading styles’:

  • A (Hum/Male/17) sets aside time to read the thesis. He checks who is in the references to see that the writers are there who should be there. Then he reads slowly, from the beginning like a book, but taking copious notes.
  • B (Sc/Male/22) reads the thesis from cover to cover first without doing anything else. For the first read he is just trying to gain a general impression of what the thesis is about and whether it is a good thesis—that is, are the results worthwhile. He can also tell how much work has actually been done. After the first read he then ‘sits on it’ for a while. During the second reading he starts making notes and reading more critically. If it is an area with which he is not very familiar, he might read some of the references. He marks typographical errors, mistakes in calculations, etc., and makes a list of them. He also checks several of the references just to be sure they have been used appropriately.
  • C (SocSc/Female/27) reads the abstract first and then the introduction and the conclusion, as well as the table of contents to see how the thesis is structured; and she familiarises herself with appendices so that she knows where everything is. Then she starts reading through; generally the literature review, and methodology, in the first weekend, and the findings, analysis and conclusions in the second weekend. The intervening week allows time for ideas to mull over in her mind. On the third weekend she writes the report.
  • D (SocSc/Male/15) reads the thesis from cover to cover without marking it. He then schedules time to mark it, in about three sittings, again working from beginning to end. At this stage he ‘takes it apart’. Then he reads the whole thesis again.

from [cite source=’doi’]10.1080/0307507022000011507[/cite] Mullins, G. & Kiley, M. (2002), It’s a PhD, not a Nobel Prize: how experienced examiners asses research theses, Studies in Higher Education, 27, 4, pp.369-386. DOI:10.1080/0307507022000011507

Parenthetical comments are (discipline/gender/interview number). Thanks to the NUIG Postgrad Research Society for suggesting this paper.

Posted in books and reading, higher education, PhD diary, scholarly communication | Comments (0)

Apple seizes control of iOS purchase chain: enforces 30% cut for Apple by prohibiting sales-oriented links from apps to the Web

February 16th, 2011

Apple’s press release about its “new subscription services” seems at first innocuous, and the well-crafted quote ((

“Our philosophy is simple—when Apple brings a new subscriber to the app, Apple earns a 30 percent share; when the publisher brings an existing or new subscriber to the app, the publisher keeps 100 percent and Apple earns nothing,” said Steve Jobs, Apple’s CEO. “All we require is that, if a publisher is making a subscription offer outside of the app, the same (or better) offer be made inside the app, so that customers can easily subscribe with one-click right in the app. We believe that this innovative subscription service will provide publishers with a brand new opportunity to expand digital access to their content onto the iPad, iPod touch and iPhone, delighting both new and existing subscribers.”

– Steve Jobs at “Apple Launches Subscriptions on the App Store“)) from Steve Jobs has been widely reposted:
“when Apple brings a new subscriber to the app, Apple earns a 30 percent share; when the publisher brings an existing or new subscriber to the app, the publisher keeps 100 percent and Apple earns nothing.” Yet analysts reading between the lines have been less than pleased.

Bad for publishers

The problems for publishers? (See also “Steve Jobs to pubs: Our way or highway“)

  • Apple takes a 30% cut of all in-app purchases ((Booksellers call this “the agency model“.))
  • Apps may not bypass in-app purchase: apps may not link to an external website (such as Amazon) ((Apple has confirmed that Kindle’s “Shop in Kindle Store” must be removed.)) that allows customers to buy content or subscriptions.
  • Content available for purchase in the app cannot be cheaper elsewhere.
  • The customer’s demographic information resides with Apple, not with the publisher. Customers must opt-in to share their name, email, and zipcode with the publisher, though Apple will of course have this information.
  • Limited reaction time; changes will be finalized by June 30th.

Bad for customers?

And there are problems for customers, too.

  • Reduction of content available in apps (likely for the near-term).
  • More complex, clunky purchase workflows (possible).
    Publishers may sell material only outside of apps, from their own website, to avoid paying 30% to Apple. Will we see a proliferation of publisher-run stores?
  • Price increases to cover Apple’s commission (likely).
    If enacted, these must apply to all customers, not just iOS device users.
  • Increased lockdown of content in the future (probably).
    Apple already prevents some iBooks customers from reading books they bought and paid, using extra DRM affecting some jailbroken devices. Even though jailbreaking is explicitly legal in the United States. And even though carrier unlock and SIM-free phones are not available in the U.S.

More HTML5 apps?

The upside? Device-independent HTML5 apps may see wider adoption. HTML5 mobile apps work well on iOS, on other mobile platforms, and on laptops and desktops.

For ebooks, HTML5 means Ibis Reader and Book.ish. For publishers looking to break free of Apple, yet satisfy customers, Ibis Reader may be a particularly good choice: this year they are focusing on licensing Ibis Reader, as Liza Daly’s Threepress announced in a savvy and well-timed post, anticipating Apple’s announcement. Having been a beta tester of Ibis Reader, I can recommend it!

If you know of other HTML5 ebook apps, please leave them in the comments.

Tags: , , , , , , , , , , , , , , ,
Posted in books and reading, future of publishing, information ecosystem, iOS: iPad, iPhone, etc. | Comments (0)

Supporting Reading

January 21st, 2011

Yesterday I spoke at Beyond the PDF about use cases for reading. Slides are below; the presentation was also webcast, so I hope to share a video recording when it becomes available. The video is now on Youtube (part of the Beyond the PDF video playlist) and below.

Thanks to the DERI Social Software Unit for feedback on an earlier version of this presentation. I’m particularly grateful to Allen Renear and Carole Palmer from UIUC, whose call for ontology-aware reading tools pushed me down this path, and to Geoffrey Bilder who presented these ideas in a way I couldn’t help thinking about and remixing. Cathy Marshall’s clear exposition, in Reading and Writing the Electronic Book was fundamental to digging deeper.

Tags: ,
Posted in books and reading, future of publishing, library and information science, scholarly communication, social semantic web | Comments (2)

Wanted: the ultimate mobile app for scholarly ereading

January 7th, 2011

Nicole Henning suggests that academic libraries and scholarly presses work together to create the ultimate mobile app for scholarly ereading. I think about the requirements a bit differently, in terms of the functional requirements.

The main functions are obtaining materials, reading them, organizing them, keeping them, and sharing them.

For obtaining materials, the key new requirement is to simplify authentication: handle campus authentication systems and personal subscriptions. Multiple credentialed identities should be supported. A secondary consideration is that RSS feeds (e.g. for journal tables of contents) should be supported.

For reading materials, the key requirement is to support multiple formats in the same application. I don’t know of a web app or mobile app that supports PDF, EPUB, and HTML. Reading interfaces matter: look to Stanza and Ibis Reader for best-in-class examples.

For organizing materials, the key is synergy between the user’s data and existing data. Allow tags, folders, and multiple collections. But also leverage existing publisher and library metadata. Keep it flexible, allowing the user to modify metadata for personal use (e.g. for consistency or personal terminology) and to optionally submit corrections.

For keeping materials, import, export, and sync content from the user’s chosen cloud-based storage and WebDAV servers. No other device (e.g. laptop or desktop) should be needed.

For sharing materials, support lightweight micropublishing on social networks and email; networks should be extensible and user-customizable. Sync to or integrate with citation managers and social cataloging/reading list management systems.

Regardless of the ultimate system, I’d stress that device independence is important, meaning that an HTML5 website would probably the place to start: look to Ibis Reader as a model.

Tags: , ,
Posted in books and reading, future of publishing, information ecosystem, library and information science, scholarly communication | Comments (5)