I've been playing with downloading epub books from Google. EPUB is a format for digital publication targeted to portable readers. That's not what I care about right now. It is cool that it uses plain old xhtml and standard image formats to represent the contents of a book. That means if you can unpack an EPUB file, which is very easy, you have access to text and images in readily consumable form.
I'm not the first to point this out. See Greg Crane What do you do with a Million Books for early thinking on the large scale implications of Google's work.
In terms of playing, here's what's fun. If you go to the G Books page for H. Chase's Catalogue of Arretine pottery from the MFA, you'll see a link to download the "EPUB" version.
Once you've downloaded that file, it's easy to unpack. I'm a Mac/Linux user. If you are too, and you like the command line, 'unzip Catalogue_of_Arretine_pottery.epub' will do the trick. Otherwise, change the extension to ".zip" and double-click on the file. I'm sure something similar will work in Windows.
Once unpacked, you have two directories, 'OEBPS' and 'META-INF'. The first is the one with all the goodies in it. Open 'OEBPS/images' and you'll see the plates from the book. Those files aren't hi-res, but better than nothing.
The text is the in 'Content-###.xml' files. These can be opened in a browser directly.
As people like Greg have noted, cool things will happen when communities, such as scholars/enthusiasts of the ancient Mediterranean world, take these files and add value to them. In the meantime, I like being able to get at the images, and to have the text on my hard-drive so its available for searching. On the Mac, Spotlight does a good job of indexing the Content files. It also indexes the compressed archives when their extensions are ".zip". It seems to ignore the ".epub" files but I bet that will change soon enough.