Skip to main content

Document Metadata Reference


Decision‑Making Insights

  • Structural consistency: Verify total_sections and review chapters to ensure logical chapter breaks.
  • Narrative flow: Chart aggregated_sentiment and top_topics to understand theme and tone progression.
  • Readability targets: Compare avg_readability to your intended audience’s reading level.
  • Genre alignment: Use genre and global_top_keywords to posit~~ion your manuscript in the right market.
  • Marketing hooks: Leverage global_top_keywords and top_topics for blurbs, tags, and promotional copy.

How each analytics tile can guide your edits

UI tile / metric blockWhat it revealsHow to use it to polish the draft
Bag of Words / Phrases (top single-words & bigrams)Over-used lexis and recurring images.Replace the most common verbs/nouns with fresher synonyms, trim filler (“back”, “like”), or lean into a deliberate motif if repetition is intentional.
Sentiment gauge + bubblesOverall emotional temperature (neg/neu/pos) and polarity balance.If the gauge is stuck on the negative side, seed in lighter beats or hopeful reactions to give readers moments to breathe. Conversely, dial up tension if the story feels too neutral.
Word Count / Est. Pages / Read-TimeRaw size and pacing expectations.Compare to genre norms: if your YA chapter runs 9 000 words (≈30 min), consider tightening scenes or splitting chapters for younger audiences.
Readability (Flesch–Kincaid, Fog, SMOG)Sentence complexity & estimated U.S. grade level.
Flesch–Kincaid = grade level based on sentence length and syllables.
Gunning Fog = grade level based on sentence length and complex words.
SMOG = grade level based solely on polysyllabic words.
Aim for 4–6 for middle-grade, 6–9 for YA, and 10+ only for academic or literary prose. Lower scores mean more accessible reading. Adjust by simplifying sentence structure and vocabulary.
Lexical metrics (TTR, Hapax, Avg Word Len)Vocabulary richness and variety.
TTR ≥ 0.5 = dense variety; < 0.25 = repetitive.
Hapax shows one-off words count.
• Longer avg word-len hints at more advanced diction.
Boost TTR/Hapax by pruning echoes and adding sensory specifics; lower them when a simpler voice is desired.
Dialog Ratio% of dialogue lines vs narrative exposition.Heavy narration (< 30 % dialogue) may feel slow—inject conversations. If > 60 %, add grounding action and interiority so scenes don’t float.
Top Topics (add-on)Theme clusters extracted via LDA/BERTopic.Verify that dominant topics match your intended themes; amplify or trim scenes to keep focus.
NER / Beats / Tropes / AI metric (add-on)Named-entity consistency, plot-beat coverage, trope tagging, and overall “engagement” score.Catch name misspellings, ensure required beats occur at genre-expected positions, and target trope usage for marketing blurbs.
Sentiment Trend (sparkline per chapter)Emotional arc across the manuscript.Check for flatlines: long stretches of identical sentiment can dull tension. Vary peaks/valleys to sustain reader engagement.
Word Trends (per-chapter counts)Pace & scene density.Sudden spikes may signal info-dumps; valleys might indicate scenes too sparse—rebalance for rhythm.

Quick example (excerpt)

{
"readability": { "flesch_kincaid": 4.8, "gunning_fog": 7.2, "smog": 8.1 },
"lexical_metrics": { "ttr": 0.31, "hapax": 523, "avg_word_length": 4.43 },
"dialog_ratio": 0.54
}

Interpretation:

  • F-K 4.8 → approachable for middle-grade readers.
  • TTR 0.31 → healthy variety; no urgent action.
  • Dialog 54 % → dialogue-driven; sprinkle more scene texture if pacing feels rushed.

Interpretation — Document-level Word & Sentiment Trends

  • Word sparkline
    Tall spikes mark chapters > 2 SD above mean word count—candidates for trimming or splitting.
    Deep valleys highlight under-developed scenes that may need fleshing out.

  • Average line (grey)
    Reveals drift: if late chapters all hover well above the line, pacing is inflating—rebalance earlier sections.

  • Sentiment bars

    • Green = % positive sentences* Red = % negative.
      Large red stacks warn of sustained gloom; aim for varied stacks to keep emotional movement.
  • Compound line
    –100 → +100 single-number polarity summary per chapter.
    • While the bars show how much of each polarity is present, compound blends them into one value (≈ weighted pos – neg). A chapter with modest bars can still have a strong +/– compound if one side dominates.
    • Use it to spot overall vibe at a glance, then inspect bar mix for nuance.

  • Smoothed net curve
    A gentle roller-coaster = healthy emotional rhythm. A long plateau (± 5 %) signals monotony—add a reversal or comic relief.

info

Advanced analytics—Top Topics, Named-Entity consistency, Trope & Beat detection, and AI-powered engagement scoring—are available as an add-on at checkout. Unlock them when you need deeper structural or marketing insights.

Use these panels as a revision checklist: tackle readability and repetition first (highest return for effort), refine sentiment flow next, and finish with advanced add-ons for final polish.

Aggregated insights at the manuscript level

FieldTypeDescription
total_sectionsusizeNumber of sections/chapters analyzed.
total_wordsusizeSum of all word counts.
total_paragraphsusizeSum of all paragraph counts.
total_charsusizeSum of all character counts.
avg_read_time_minf64Average reading time per section (in minutes).
global_term_freqsHashMap<String, usize>Top N words by frequency across the document.
global_bigramsHashMap<String, usize>Top N two-word phrases across the document.
global_top_keywordsHashMap<String, f64>Top N TF–IDF keywords for the entire manuscript.
aggregated_sentimentSentimentMean sentiment (neg/neu/pos/compound) averaged over sections.
top_topicsVec<String>Most frequent topics across all sections.
genreOption<String>Majority-genre label based on section genres.
avg_sentence_lengthf64Mean sentence length across sections.
avg_ttrf64Mean type-token ratio across sections.
avg_hapaxf64Mean hapax legomena count per section.
avg_word_lengthf64Mean average word length per section.
avg_readabilityReadabilityMetricsMean readability scores (Flesch–Kincaid, Gunning Fog, SMOG) across sections.
avg_dialog_ratiof64Mean dialogue ratio (proportion of words in dialogue) averaged over sections.
authorOption<String>First detected author (if any) from the opening sections.
chaptersVec<ChapterInfo>List of each chapter’s info: ChapterInfo { title, subtitle, words, sentiment, dialog_ratio }.