Research2026-07-01

Think in English, Answer in Korean: Efficient Adaptation of Multilingual Tool-Using Agents

Originally published byArxiv CS.AI

arXiv:2606.31648v1 Announce Type: new Abstract: We present LuckyStar 111B, a 111B-parameter hybrid reasoning model developed through a collaboration between Cohere and LG CNS for Korean-English enterprise agents under practical memory and serving constraints. The model trains from Cohere's fully...

What Happened

Cohere and LG CNS have jointly released LuckyStar 111B, a 111-billion parameter hybrid reasoning model specifically designed for Korean-English enterprise applications. The model builds on Cohere’s existing multilingual capabilities, but introduces a novel “think in English, answer in Korean” paradigm. This approach allows the model to leverage English-centric reasoning pathways—where most high-quality training data and logical structures exist—while producing final outputs in Korean, optimizing for both accuracy and linguistic fluency under practical memory and serving constraints.

The research addresses a critical bottleneck: many state-of-the-art large language models are trained predominantly on English data, making them proficient at reasoning but weak at generating accurate, context-appropriate responses in lower-resource languages like Korean. LuckyStar 111B decouples the internal reasoning process from the output language, effectively using English as an intermediate representation before translating the final answer into Korean.

Why It Matters

This work tackles a fundamental asymmetry in multilingual AI: reasoning quality and language generation quality are often at odds. Models that reason in a target language directly tend to suffer from degraded logical coherence because the training data for that language is sparser. Conversely, models that reason in English and then translate lose cultural and pragmatic nuance.

LuckyStar 111B’s hybrid approach offers a pragmatic middle ground. For enterprise agents serving Korean-speaking users—such as customer support, legal document analysis, or financial advisory systems—this could mean more reliable reasoning without sacrificing natural language quality. The 111B parameter count is also notable: it is large enough to capture complex reasoning patterns, yet constrained enough to be deployable in production environments with realistic memory budgets.

For AI practitioners, this represents a shift away from monolithic multilingual models toward modular, language-agnostic reasoning architectures. It suggests that future multilingual systems may not need to be trained from scratch for every language pair. Instead, they can reuse English-centric reasoning backbones and add lightweight language-specific output modules.

Implications for AI Practitioners

First, this validates the “language-agnostic reasoning” hypothesis: that high-level logical structures are not inherently tied to any natural language. Practitioners building multilingual applications can consider separating reasoning and generation into distinct components, potentially reducing training costs and improving cross-lingual transfer.

Second, the model’s design under practical constraints—memory and serving latency—is a reminder that real-world deployment often forces compromises between capability and efficiency. LuckyStar 111B’s architecture likely includes quantization or distillation techniques to fit within enterprise infrastructure, a lesson for teams deploying large models in production.

Finally, the collaboration between Cohere (a Western AI lab) and LG CNS (a Korean enterprise) highlights the growing importance of regional partnerships. Off-the-shelf English models rarely suffice for local markets; adaptation requires deep linguistic and cultural expertise that only local partners can provide.

Key Takeaways

LuckyStar 111B introduces a “think in English, answer in Korean” paradigm, decoupling reasoning from language generation to improve both accuracy and fluency.
The model addresses a practical enterprise need: reliable multilingual agents that work under real-world memory and latency constraints.
This approach suggests a future where multilingual AI systems reuse English-centric reasoning backbones with language-specific output modules, reducing training costs.
The Cohere–LG CNS partnership underscores the value of combining global model expertise with local linguistic and domain knowledge for effective regional adaptation.

Read Original Article on Arxiv CS.AI

arxivpapersagents