Tag
GRAB uses a GNN encoder to convert relational tables into latent tokens for frozen LLMs, achieving significant performance gains in multi-table question answering.
Surflo is a feed-forward 3D reconstruction model that compresses unposed RGB views into latent tokens and decodes consistent 3D surface points via flow matching, enabling variable-resolution output and outperforming existing methods in speed.