Discussion about this post

User's avatar
Neural Foundry's avatar

This is exactly the kind of analysis the space needs right now. The part about multimodal being a "where does the work live" question is sharper than most people realize, becuase training on synthetic benchmarks wont capture the messy reality of how actuall documents get structured. What's intresting is that Google's ecosystem advantage might not just be about acces to data but about understanding the implicit workflows around that data. If you've trained on millions of real Sheets-plus-Slides combinations, you're not just seeing formats you're seeing intent.

Expand full comment

No posts

Ready for more?