Product

  • Features
  • Pricing
  • Browser Extension

Resources

  • News
  • Documentation
  • Blog
  • Quick Start
  • FAQ

Company

  • About
  • Contact

Legal

  • Terms of Service
  • Privacy Policy
  • Refund Policy
  • DMCA

© 2026 TeaNovel Translate

DiscordRSS Feed

"The best translation is the one that makes you forget you are reading a translation."

TeaNovel
  • News
  • Pricing
BlogTechnology

How NoveLM Works: The AI Engine Behind Genre-Aware Novel Translation

Inside NoveLM: 16 genre styles, 7-category NER with gender inference, 5-dimension quality scoring. How TeaNovel's AI translation engine works.

TT
TeaNovel Team
Apr 30, 202614 min read
TT
TeaNovel Team
Apr 30, 202614 min read
On this page
  • The Problem NoveLM Solves
  • Layer 1: Genre Routing — 16 Styles, 10 Aliases
  • Layer 2: Named Entity Recognition — 7 Categories with Gender Inference
  • Gender Inference
  • Entity Persistence
  • Layer 3: Genre-Specific Translation
  • What Genre-Specific Prompting Looks Like in Practice
  • Layer 4: Quality Scoring — 5 Dimensions
  • Why Quality Scoring Matters
  • Before vs. After: What NoveLM Changes
  • The Numbers
  • What This Means for Your Reading Experience
  • Frequently Asked Questions
  • Is NoveLM the same as ChatGPT or GPT-4?
  • Which genre style produces the best translations?
  • How does NoveLM handle novels that mix genres?
  • What happens if the detected genre is wrong?
  • Does quality scoring slow down translation?

Every AI translation tool runs on large language models. ChatGPT, DeepL, Google Translate — they all use neural networks trained on massive datasets to convert text from one language to another. The models are impressive. The results, for general text, are often excellent.

But novel translation is not general text. A 500-chapter xianxia novel contains thousands of recurring proper nouns that must be translated identically every time they appear. It uses genre-specific registers that shift between formal cultivation prose, casual modern dialogue, and classical wuxia cadence depending on the scene. And it demands quality transparency — a reader who does not speak Chinese needs to know whether a translation is trustworthy before investing hours reading it.

General-purpose translation handles none of this. NoveLM was built to handle all of it.

This article explains how NoveLM works — not at the code level, but at the level that matters to readers: what it does differently, why those differences exist, and how they affect the English text you read.

The Problem NoveLM Solves

Take a passage from a typical xianxia novel:

萧炎的斗气如同烈焰般在经脉中奔涌,九转玄功的第三层终于突破。他猛然睁开双眼,周身的气势骤然攀升,周围的空气都仿佛凝固了一般。

A general-purpose translator processes this as a standalone text block. It has no information about what genre this passage belongs to, no memory of how "斗气" was translated in the previous 200 chapters, no knowledge of whether "萧炎" is male or female, and no way to evaluate whether its own output is any good.

NoveLM processes the same passage through four layers:

  1. Genre routing — identifies this as xianxia content and selects the appropriate translation style
  2. Named entity recognition — identifies "萧炎" as a character (male) and "九转玄功" as a skill, applying consistent translations
  3. Translation with genre-specific prompting — generates English prose tuned for xianxia register and conventions
  4. Quality scoring — evaluates the output across five dimensions and assigns a transparent score

Each layer solves a problem that general translation ignores. Together, they produce output that reads like genre fiction rather than machine translation.

Layer 1: Genre Routing — 16 Styles, 10 Aliases

When a novel enters NoveLM's pipeline, the first decision is genre classification. NoveLM supports 16 genre-specific translation styles:

GenreRegisterExample Adaptation
XianxiaFormal, classical-inflected"Dou Qi," "tribulation," "meridians"
XuanhuanEpic fantasy, dramatic"Essence power," "divine realm"
WuxiaClassical martial, elegant"Sword qi," "internal force," "jianghu"
RomanceConversational, emotionally textured"Heart raced," natural dialogue rhythm
HistoricalFormal period proseCourt titles, period-appropriate phrasing
AncientArchaic, ceremonialClassical sentence structures
UrbanModern casual, groundedContemporary references, slang-adjacent
ModernClean contemporaryNeutral modern English
FantasyWestern fantasy-inflected"Mana," "enchantment," familiar fantasy terms
Sci-fiTechnical, preciseScientific terminology, measured cadence
HorrorTense, atmosphericShort sentences, sensory language
ComedyLight, rhythmicTiming-aware phrasing, punchline preservation
MysteryControlled, deliberateInformation pacing, clue-conscious word choice
Slice of LifeWarm, observationalEveryday language, gentle pacing
ThrillerClipped, urgentShort paragraphs, tension-building
MilitaryFunctional, authoritativeRank-specific language, tactical precision

The system also recognizes 10 genre aliases to handle common variations in how novels are tagged:

  • "cultivation" → xianxia
  • "martial arts" → wuxia
  • "science fiction" → sci-fi
  • "scifi" → sci-fi
  • "suspense" → mystery
  • "contemporary" → modern
  • "light-hearted" → comedy
  • "isekai" → fantasy
  • "reincarnation" → fantasy
  • "transmigration" → fantasy

Genre routing is not cosmetic. It changes the fundamental character of the translation output. The same Chinese sentence translated in xianxia style and modern style will produce noticeably different English — different vocabulary, different sentence structure, different rhythm. Here is a concrete example:

Original: 他凝聚全身真气,一掌击出。

Xianxia style: He gathered his True Qi throughout his body and struck with a single palm.

Modern style: He concentrated all his energy and threw a palm strike.

Wuxia style: He condensed his inner force to its peak and unleashed a palm strike.

Same meaning. Different register, different terminology, different reading experience. The xianxia reader expects "True Qi." The modern fiction reader expects "energy." The wuxia reader expects "inner force." NoveLM delivers the right term for the right genre.

Layer 2: Named Entity Recognition — 7 Categories with Gender Inference

The second layer runs before translation begins. NoveLM's NER system scans the source text and identifies entities across 7 categories:

CategoryExamplesWhy It Matters
Character萧炎 → Xiao YanNames must be consistent across 500+ chapters
Location天云山 → Heavenly Cloud MountainReaders need to recognize recurring locations
Organization天云宗 → Heavenly Cloud SectSects, guilds, and factions appear in every chapter
Skill九转玄功 → Nine Revolutions Mysterious ArtTechnique names are genre-critical terminology
Item火莲瓶 → Fire Lotus BottleArtifacts and items drive plot points
Title大长老 → Grand ElderRank and title consistency affects power hierarchy clarity
Race妖族 → Demon RaceSpecies and race names must be stable

Gender Inference

Chinese third-person pronouns present a unique challenge for translation. In spoken Chinese, 他 (he), 她 (she), and 它 (it) are pronounced identically (tā). In older or informal writing, 他 is often used generically regardless of gender. This means a translator — human or AI — must infer gender from context.

NoveLM performs gender inference using a weighted voting system across 6 basis types:

  1. Honorifics — "小姐" (miss), "公子" (young master), "娘子" (lady)
  2. Pronoun patterns — frequency of 她 vs 他 in surrounding text
  3. Name conventions — certain character components skew male or female
  4. Role descriptions — "师姐" (senior sister), "师兄" (senior brother)
  5. Contextual relationships — romantic pairings, family roles
  6. Explicit statements — authorial description of gender

Each basis type casts a weighted vote. When multiple signals agree, the system assigns gender with high confidence. When signals conflict, the system marks the character as ambiguous and uses neutral constructions until more evidence accumulates.

The result: pronouns are correct in the vast majority of cases, including for characters whose gender is not immediately obvious. This is a problem that general translation tools consistently fail at because they lack any mechanism for cross-chapter entity tracking.

Entity Persistence

Once an entity mapping is established — "萧炎 = Xiao Yan (male, character)" — it persists through every subsequent chapter. Chapter 1 and chapter 500 use the same translation for the same entity. There is no drift, no variation, and no need for the reader to figure out whether "Xiao Yan" and "Shaw Yan" are the same person.

This is the single most impactful difference between NoveLM and general translation tools. A long web novel may introduce 200+ named entities. Without persistent tracking, every entity is an opportunity for inconsistency. With tracking, the reader experiences one coherent story instead of a patchwork of disconnected chapters.

Layer 3: Genre-Specific Translation

With genre routing and NER data in place, the actual translation step begins. NoveLM generates English text using genre-specific prompting that incorporates:

  • Style profile — the register, vocabulary constraints, and sentence structure preferences for the detected genre
  • Entity map — all recognized entities with their established English translations and metadata (category, gender)
  • Cultural reference handling — genre-specific decisions about when to transliterate (keep the original term) vs. translate (convert to English equivalent)

The cultural reference decision is worth examining. In xianxia fiction, "斗气" is better kept as "Dou Qi" because the xianxia reading community uses that term universally. Translating it to "fighting spirit" is technically accurate but strips away genre identity. In a modern urban novel, the same concept might be better rendered as "fighting energy" because the modern fiction reader does not have the same genre-specific vocabulary expectations.

NoveLM makes these decisions systematically based on genre. A xianxia translation preserves genre-standard transliterations. A modern translation favors natural English equivalents. A wuxia translation uses classical martial arts terminology. These are not random choices — they are the accumulated conventions of decades of genre translation, encoded into the translation pipeline.

What Genre-Specific Prompting Looks Like in Practice

Here is the same passage rendered in three different genre styles:

Original: 她轻轻握住他的手,心中涌起一股暖意。这个世界再冷,有他在身边就足够了。

Romance style:

She gently took his hand, warmth flooding through her chest. The world could be as cold as it wanted — having him beside her was enough.

Xianxia style:

She lightly clasped his hand, a surge of warmth rising in her heart. No matter how cold this world grew, his presence at her side was sufficient.

Modern style:

She held his hand softly, a warm feeling welling up inside her. Even if the whole world turned cold, having him here was all she needed.

The romance style prioritizes emotional immediacy — "warmth flooding through her chest" is visceral and intimate. The xianxia style uses slightly formal phrasing — "his presence at her side was sufficient" carries the dignity of cultivation fiction. The modern style lands in between — natural, warm, but not heightened. All three are accurate. Each reads like it belongs in its genre.

Layer 4: Quality Scoring — 5 Dimensions

After translation, NoveLM evaluates its own output across five dimensions with weighted scoring:

DimensionWeightWhat It Measures
Accuracy30%Does the translation faithfully convey the original meaning?
Fluency25%Does the English read naturally and grammatically?
Style20%Does the translation match the genre's register and tone?
Terminology15%Are proper nouns and genre terms translated consistently?
Format10%Are paragraphs, dialogue, and structural elements preserved?

Scores follow a 100-point rubric:

Score RangeRatingInterpretation
90-100ExceptionalPublication-quality translation
75-89GoodMinor issues, fully readable
60-74AcceptableNoticeable but non-critical issues
40-59PoorSignificant problems, may require review
20-39Very PoorMajor inaccuracies or readability issues
0-19UntranslatedTranslation failed or was not attempted

Why Quality Scoring Matters

Quality scoring serves three purposes:

Reader confidence. A chapter that scores 94 can be read without concern. A chapter that scores 68 on Terminology signals that some proper nouns may have been handled incorrectly — and the reader can review those terms alongside the chapter in the wider TeaNovel workflow.

Translation transparency. No AI translation is perfect. The question is whether the tool acknowledges its limitations or hides them. NoveLM scores every chapter visibly. If a chapter struggled — perhaps because the source text contained unusual dialect, code-switching between literary and vernacular Chinese, or ambiguous pronouns — the score reflects that. The reader knows before investing time.

Continuous improvement signal. Scoring patterns reveal which types of content translate well and which do not. Chapters heavy in classical Chinese poetry consistently score lower on Fluency because poetic forms resist natural English prose. Chapters with extensive dialogue consistently score higher because conversational Chinese maps well to conversational English. These patterns inform pipeline improvements over time.

Before vs. After: What NoveLM Changes

To make the four layers concrete, here is what happens when the same passage goes through NoveLM versus a general translation pipeline:

Original (xianxia scene):

萧炎的斗气如同烈焰般在经脉中奔涌,九转玄功的第三层终于突破。他猛然睁开双眼,周身的气势骤然攀升,周围的空气都仿佛凝固了一般。

General translation (no genre routing, no NER, no scoring):

Xiao Yan's fighting spirit surged through his meridians like a blazing fire. The third level of the Nine Turns Mysterious Technique finally achieved a breakthrough. He suddenly opened his eyes wide, and the momentum around his body sharply increased. The air around him seemed to solidify.

NoveLM (xianxia style, NER active, scored 93):

Xiao Yan's Dou Qi surged through his meridians like raging flames as the third layer of the Nine Revolutions Mysterious Art finally broke through. His eyes snapped open, his aura climbing sharply, and the very air around him seemed to freeze solid.

What changed:

DimensionGeneral TranslationNoveLM
Genre terminology"fighting spirit""Dou Qi" (genre standard)
Skill name"Nine Turns Mysterious Technique" (varies)"Nine Revolutions Mysterious Art" (locked by NER)
Prose structure4 short sentences2 flowing compound sentences
RegisterNeutral informationalDramatic xianxia cadence
Quality scoreNone (unknown confidence)93 (Exceptional)

The general translation is not wrong. Every word is defensible. But it reads like a translation. The NoveLM output reads like xianxia fiction — because the genre routing selected xianxia register, the NER locked skill names, and the style profile shaped sentence rhythm to match genre expectations.

The Numbers

Every claim in this article is derived from NoveLM's actual codebase:

  • 16 genre-specific translation styles
  • 10 genre alias mappings
  • 7 NER entity categories (character, location, organization, skill, item, title, race)
  • 6 gender inference basis types with weighted voting
  • 5 quality scoring dimensions (Accuracy 30%, Fluency 25%, Style 20%, Terminology 15%, Format 10%)
  • 6 scoring tiers (Exceptional 90-100, Good 75-89, Acceptable 60-74, Poor 40-59, Very Poor 20-39, Untranslated 0-19)
  • 4 supported source sites via browser extension (Qidian, JJWXC, QDMM, Fanqie)

These are not marketing numbers. They are the actual parameters of the system that translates your chapters. Every genre style exists as a distinct configuration. Every NER category has dedicated extraction logic. Every quality dimension has a defined weight and rubric.

What This Means for Your Reading Experience

NoveLM's four layers work together to solve a problem that general translation tools do not even attempt: making a 500-chapter Chinese web novel read like a coherent, genre-appropriate English novel from start to finish.

Genre routing means your xianxia novel reads like xianxia — not like a business email or a Wikipedia article. NER means "Xiao Yan" is always "Xiao Yan" and never "Shaw Yan" or "Little Flame." Quality scoring means you know which chapters to trust and which to double-check. And the immersive reader means you consume all of this in an interface designed for fiction, not for reviewing translation output.

No translation tool is perfect. NoveLM will not produce output identical to what a professional human translator would write. But it produces output that respects the genre, maintains consistency, scores its own confidence, and improves chapter by chapter as the NER database grows.

Try it at read.teanovel.com — 1,000 free credits per month, enough to see the difference between general translation and genre-aware translation for yourself.

Frequently Asked Questions

Is NoveLM the same as ChatGPT or GPT-4?

No. NoveLM is TeaNovel's translation pipeline, not a single language model. It includes genre routing, named entity recognition, genre-specific prompting, and quality scoring layers built on top of AI models accessed through the Vercel AI Gateway. The layers that make NoveLM different — 16 genre styles, 7-category NER, 5-dimension scoring — do not exist in any raw LLM.

Which genre style produces the best translations?

It depends on the source material. Genre routing works best when the style matches the content. The most dramatic improvements over general translation appear in xianxia, wuxia, and romance — genres with strong community-standard terminology and distinct registers. Modern and urban novels show smaller but still meaningful improvements in natural phrasing.

How does NoveLM handle novels that mix genres?

Genre routing is applied at the novel level based on the primary genre tag. For novels that blend genres — a xianxia novel with romance subplots, or an urban novel with cultivation elements — the primary genre style handles the majority of content well. NoveLM does not currently switch styles mid-chapter, though individual passages naturally adapt because the underlying model adjusts to conversational vs. narrative context.

What happens if the detected genre is wrong?

Current public workflow uses source-platform metadata and automatic genre routing. There is no public manual genre-override UI in the extension today.

Does quality scoring slow down translation?

Quality scoring adds minimal processing time to each chapter. It runs as a post-translation evaluation step and does not affect the translation itself. The score is generated and displayed alongside the translated chapter in the reader — there is no additional wait for the reader.

←Back to Blog

On this page

  • The Problem NoveLM Solves
  • Layer 1: Genre Routing — 16 Styles, 10 Aliases
  • Layer 2: Named Entity Recognition — 7 Categories with Gender Inference
  • Gender Inference
  • Entity Persistence
  • Layer 3: Genre-Specific Translation
  • What Genre-Specific Prompting Looks Like in Practice
  • Layer 4: Quality Scoring — 5 Dimensions
  • Why Quality Scoring Matters
  • Before vs. After: What NoveLM Changes
  • The Numbers
  • What This Means for Your Reading Experience
  • Frequently Asked Questions
  • Is NoveLM the same as ChatGPT or GPT-4?
  • Which genre style produces the best translations?
  • How does NoveLM handle novels that mix genres?
  • What happens if the detected genre is wrong?
  • Does quality scoring slow down translation?

Read Next

May 13, 202611 min read

How to Find Chinese Web Novels in English: 5 Discovery Methods That Actually Work (2026)

Apr 27, 20267 min read

How to Read Fanqie Novels in English: Free Chinese Novels Translated

Apr 26, 202610 min read

How to Read Qidian Novels in English: Complete 2026 Guide