Tweet
With such an architecture tagging with meta-data an image, locally thanks to vision models (e.g. @MoondreamAI) to be searchable, filterable and usable in XR becomes trivial. It’s just a single additional (optional) step in the cascading conversion process : https://x.com/utopiah/status/1857378905872609572
(original)