korean-nlp

#korean-nlp

SSP-based construction of evaluation-annotated data for fine-grained aspect-based sentiment analysis

arXiv cs.CL ↗ · 3d ago Cached

This paper presents the construction of a Korean evaluation-annotated corpus (EVAD) for fine-grained aspect-based sentiment analysis in e-commerce reviews using Semi-Automatic Symbolic Propagation. It evaluates KoBERT and KcBERT models on the dataset, achieving high F1 scores in aspect-value pair recognition.

0 favorites 0 likes

#korean-nlp

Generating training datasets for legal chatbots in Korean

arXiv cs.CL ↗ · 3d ago Cached

This paper presents a method for generating large-scale, labeled training datasets for legal chatbots in Korean using Local Grammar Graphs, achieving 91% F1-score with a DIET classifier.

0 favorites 0 likes

#korean-nlp

Optimizing Korean-Centric LLMs via Token Pruning

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper presents a systematic benchmark of token pruning—a compression technique that removes tokens and embeddings for irrelevant languages—applied to Korean-centric LLM tasks. The study evaluates popular multilingual models (Qwen3, Gemma-3, Llama-3, Aya) across different vocabulary configurations and finds that token pruning significantly improves generation stability and reduces memory footprint for domain-specific deployments.

0 favorites 0 likes

korean-nlp

SSP-based construction of evaluation-annotated data for fine-grained aspect-based sentiment analysis

Generating training datasets for legal chatbots in Korean

Optimizing Korean-Centric LLMs via Token Pruning

Submit Feedback