Multivariate Probability Models in Machine Learning [D]
Summary
A discussion thread on multivariate probability models in machine learning.
Similar Articles
What is the point of MoE models, beyond being faster?
A discussion about the advantages of Mixture of Experts (MoE) models over dense models beyond speed, considering RAM constraints and scaling limits.
Get in here: Community model build thread
A thread proposing a method for creating a community AI model using crowdsourced compute via Branch-Train-Stitch to build a Mixture-of-Experts model from independently trained submodels, with discussion of hardware requirements, participant involvement, and technical challenges.
Semantic DLM+: Improving Diffusion Language Models through Bias-variance Trade-off in Transition Kernel Design
This paper theoretically analyzes diffusion language models through a bias-variance lens, identifying trade-offs between masking and uniform diffusion kernels. It proposes SemDLM+, which adds a global transition and semantic-frequency penalty to overcome the semantic basin problem, achieving competitive generation quality on LM1B and OpenWebText benchmarks.
Anomalies in Multivariate Time Series Benchmarks Are Mostly Univariate
This paper introduces a diagnostic framework for multivariate time series anomaly detection benchmarks and finds that labeled anomalies are mostly detectable from individual channels, challenging the need for cross-channel modeling. The authors call for more structurally diverse evaluation sets.
@SilvioMartinico: The late-interaction multivector retrieval ecosystem is exploding right now. To help separate the signal from the noise…
A curated list of top models, engines, libraries, and datasets for late-interaction multivector retrieval, organized in an 'Awesome Multivector Retrieval' resource.