Tweet

March 10, 2024 at 3:47:44 PM

Not perfect but getting meaningful results. It’s also quite fast so I’ll next try to the STT and OCR as inputs, non-stop.

For context the (poorly chunked) 1100 pages comes back as ~2MB of embeddings. Querying that takes .1s https://x.com/utopiah/status/1766356919516627328