Research · 研究

Pushing the frontier of reasoning, code, and Chinese-language AI.

We work on agentic coding, multimodal evaluation, reasoning training, low-resource NLP, and model auditing, writing each result up as a bilingual explainer for researchers and general readers alike.


7 papers6 research themesUpdated June 2026

Milestones

  1. 2026Five research lines released
    • SafeGEO · GEO risk evaluation for recommendation agents
    • SWE-Bench Mobile · KDD 2026
    • ThinkTwice · self-refinement RLVR
    • Grounded Chess Reasoning · Master Distillation
    • OasisSimp · low-resource simplification dataset
  2. August 2025SEAM accepted at COLM 2025
  3. December 2024Report Cards receives NeurIPS SoLaR Spotlight