Tag
This study reveals that LLM text embeddings are hijacked by high-frequency tokens (e.g., periods, articles) and proposes EmbedFilter, which performs SVD on the unembedding matrix and subtracts the projection component to release true semantics, achieving zero-training-cost dimensionality reduction and retrieval efficiency gains.
This paper investigates the generalization behavior of Fourier Neural Operators and Deep Operator Networks under distribution shifts in a variable-coefficient wave equation, revealing that FNO struggles with high-frequency inputs while DeepONet shows milder degradation.