Show HN: Context-aware Japanese furigana using Sudachi and ModernBERT
Summary
EZFurigana is a free, privacy-focused tool that uses Sudachi and ModernBERT to add context-aware furigana to Japanese text, supporting various input formats and customization options.
View Cached Full Text
Cached at: 05/29/26, 04:20 PM
Similar Articles
Show HN: A working reference implementation of context engineering
A working reference implementation of context engineering — a discipline for designing, retrieving, and injecting organizational context into AI systems to produce accurate, domain-specific outputs. The repo demonstrates five components (corpus, retrieval, injection, output, enforcement) running against Amazon Bedrock with Claude.
JFinTEB: Japanese Financial Text Embedding Benchmark
JFinTEB introduces the first comprehensive benchmark for evaluating Japanese financial text embeddings, addressing a gap in domain-specific and language-specific evaluation resources. The benchmark includes retrieval and classification tasks evaluated across Japanese-specific, multilingual, and commercial embedding models, with datasets and evaluation framework publicly released.
ContextRAG: Extraction-Free Hierarchical Graph Construction for Retrieval-Augmented Generation
ContextRAG introduces an extraction-free method for constructing hierarchical graph indices for retrieval-augmented generation, using Residual-Quantization K-Means and Formal Concept Analysis to reduce LLM calls and tokens by orders of magnitude while maintaining competitive F1 scores on multi-hop questions.
HKUDS/RAG-Anything
HKUDS released RAG-Anything, an open-source all-in-one multimodal retrieval-augmented generation framework based on LightRAG.
@HuggingPapers: NVIDIA just released an optimized version of the Kokoro TTS model on Hugging Face A lightweight 82M parameter speech sy…
NVIDIA released an optimized ONNX version of the Kokoro TTS model on Hugging Face. The 82M parameter model is lightweight, fast, and ready for commercial use.