Earlier this year the Internet Archive began culling over 14 million images from their public domain ebooks, then began uploading them to the Internet Archive’s Flickr account. This means that all of the historic images are now easily searchable and downloadable, something that wasn’t really possible before without downloading each ebook and finding the images yourself, then exporting them.
The ebooks are easily searchable already thanks to the Optical Character Recognition software which was used when adding them to the archive, but it didn’t work for images. Now with the help of Flickr, those looking for historic text and images can get the best of both worlds.
“The software also copied the caption for each image and the text from the paragraphs immediately preceding and following it in the book,” said The Internet Archives Communications Technology Scholar Kavel Leetaru when speaking with the BBC.
The software used isn’t perfect, so admittedly some of the tags on images will be imprecise, but to have such a vast library of easily searchable content is great for learning purposes and the team are hoping that libraries around the world will one day follow suit and digitize their books and images.
Thank you Arstechnica for providing us with this information.
Image courtesy of Arstechnica.
Electronic Arts (EA) announced today that its games were played for over 11 billion hours…
Steam's annual end-of-year recap, Steam Replay, provides fascinating insights into gamer habits by comparing individual…
GSC GameWorld released a major title update for STALKER 2 this seeking, bringing the game…
Without any formal announcement, Intel appears to have revealed its new Core 200H series processors…
Ubisoft is not having the best of times, but despite recent flops, the company still…
If you haven’t started playing STALKER 2: Heart of Chornobyl yet, now might be the…