IMLJD: A Computational Dataset for Indian Matrimonial Litigation Analysis
Summary
The paper introduces IMLJD, a computational dataset designed for analyzing Indian matrimonial litigation, supporting natural language processing and legal analytics research.
View Cached Full Text
Cached at: 05/20/26, 08:25 AM
# IMLJD: A Computational Dataset for Indian Matrimonial Litigation Analysis Source: [https://arxiv.org/abs/2605.19346](https://arxiv.org/abs/2605.19346) Bibliographic Tools ## Bibliographic and Citation Tools Bibliographic Explorer Toggle Code, Data, Media ## Code, Data and Media Associated with this Article Demos ## Demos Related Papers ## Recommenders and Search Tools About arXivLabs ## arXivLabs: experimental projects with community collaborators arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website\. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy\. arXiv is committed to these values and only works with partners that adhere to them\. Have an idea for a project that will add value for arXiv's community?[**Learn more about arXivLabs**](https://info.arxiv.org/labs/index.html)\.
Similar Articles
@tom_doerr: Curated list of instruction and reasoning datasets for LLMs https://github.com/mlabonne/llm-datasets…
A curated list of instruction and reasoning datasets for LLMs, compiled by mlabonne, with details on dataset characteristics, licenses, and use cases.
LAUKIN: A Multi-jurisdictional Common Law Contract Dataset
Introduces LAUKIN, a dataset of clause pairs from Australia, UK, and India contracts labeled for legal equivalence, and evaluates 12 models achieving 65.11% macro-F1, establishing a challenging benchmark.
RTI-Bench: A Structured Dataset for Indian Right-to-Information Decision Analysis
Introduces RTI-Bench, a structured dataset for analyzing decisions under India's Right to Information act, useful for NLP and legal AI research.
BIASEDTALES-ML: A Multilingual Dataset for Analyzing Narrative Attribute Distributions in LLM-Generated Stories
Researchers introduce BIASEDTALES-ML, a large-scale multilingual dataset of ~350,000 LLM-generated children's stories across eight languages, designed to analyze narrative attribute distributions and cross-lingual bias patterns in language model outputs. The work reveals significant cross-lingual variability, highlighting limitations of English-centric bias evaluations.
We’ve been analyzing how people are using LLMs for legal and compliance tasks (GDPR, AI Act, etc.).
Analysis of LLM usage in legal and compliance tasks reveals that models often produce confident but unverifiable citations, raising questions about reliable legal grounding for AI outputs.