How many AIs does it take to read a PDF?

Image: Kristen Radtke / The Verge

Last November, the House Oversight Committee had just released 20,000 pages of documents from the estate of Jeffrey Epstein, and Luke Igel and some friends were clicking around, trying to follow the threads of conversation through garbled email threads and a PDF viewer that was, frankly, “gross.” In the coming months, the Department of Justice would release its own batches of files, more than three million of them – again, all PDFs.

This was a problem. While the Department of Justice had run optical character recognition over the text, it was not very good, Igel said, rendering the files more or less unsearchable.

“There was no interface …

Read the full story at The Verge.

Read more @ TheVerge

Latest posts

DJI will pay $30K to the man who accidentally hacked 7,000 Romo robovacs

The DJI Romo robot vacuums. | Image: DJI On Valentine's Day, I brought you a story that's since made headlines all around the world: How...

The Trump administration says it can’t process tariff refunds because of computer problems

The US Customs and Border Protection says it currently can't comply with an order to process billions of dollars in refunds stemming from tariffs...