AP Scans Sarah Palin Book Without Permission; Look Out Google Book Search

Google, accused by some as being a book thief, now has company — the Associated Press. The AP patted itself on the back in an internal memo that detailed how it scanned a copy of Sarah Palin’s book without permission, to make it searchable.

The irony is rich. The AP hasn’t taken a stance against Google Book Search that I know of, on whether it considers the project to violate fair use or not. But the AP has been pretty clear that it views the use of story headlines and summaries by Google and others to go beyond fair use.

So scanning an entire book, without express permission? Seems like that would go well beyond the AP’s view of fair use, as well.

The Talking Points Memo has a copy of the AP internal memo, which details what happened:

They bought a copy, ripped it from its spine and scanned it into the system so it could be read and electronically searched.

This is the core of what Google Book Search does, except it doesn’t have to rip books from their spine. But it does scan books into its system, so that it can be electronically searched. If the copyright owner hasn’t given permission, it cannot be read. (See my Search Engines, Permissions & Moving Forward In Copyright Battles post for more about the issues involved).

Ah, but Google does it for profit. The AP did it for journalism.

Well, journalism is a commercial use as well. AP members carrying those reports made money off of the ads. And Palin could argue that AP, by cherry-picking out of her book before the AP was given a review copy and permission, has leeched off her content and devalued the news value in it in the same way the AP complains others do to its stories. Folks pushing that “hot news” laws need to be updated, take note of a news case study, how the AP dealt with Palin’s book.

Meanwhile out in Connecticut, the bloggers-ripoff-newspapers story takes a new twist. Actually, an old one — newspapers accused of ripping off other newspapers. The Journal Inquirer has sued the Hartford Courant of plagiarism. Writes the New York Times:

Online, The Courant credited many if not all of the articles to the original newspapers, Richard P. Weinstein, The Journal Inquirer’s lawyer, said. But in print, the attribution was often dropped, and the byline of a Courant writer was added. The articles were rearranged and rewritten to some extent, but some phrases from the originals remained intact.

Postscript: David Weir has a comment from AP here, where it explains that the book (apparently only one was purchased) was scanned so that AP staffers in bureaus such as Washington and Alaska could read relevant sections, ending that it wasn’t scanned “for public consumption.”

To clarify against Google Book Search, the public cannot “consume” a scanned book, unless a rightsholder has given explicit permission for that book to be shown online. That’s the only way you can read extended passages or complete works, or, if a book is in the public domain. Otherwise, as best, all you see are small quotations from within the book that match your search term.

It sounds like the AP took the single book and made it possible for multiple people to read extended sections, if not the entire thing, within the organization. Whether it’s fair use to reprint a book like this because your a journalism organization is beyond my knowledge.