CAD$99

100k Story Prompts Synthetic LLM Training Data

Buy this

100k Story Prompts Synthetic LLM Training Data

CAD$99

🧠 100,000 MYTHOPUNK STORY PROMPTS — High-Fidelity Multiverse Dataset for LLMs, Writers, and Narrative Engines

📦 Dataset ID: MP-100K | Format: text | Structured for AI & Creative Development

✨ "Not just prompts — a self-consistent mythos spanning timelines, archetypes, and symbolic recursion."

Step beyond procedural randomness and into a synthetic mythopunk continuum. This dataset contains 100,000 ultra-specific, continuity-aware story prompts, each crafted to simulate the narrative depth of a living mythology. Optimized for use in LLM training, world generation, game design, or multi-agent storytelling environments.


🔍 What's Inside?

Each structured prompt includes:

  • Protagonist Identity: Character variants like Isolde Fairfax, Priya Wintermoor, Nyx Fleshbane, Orion Solborne, and more — each with a trackable motivation arc.
  • Symbolic Motivation: Examples include “trying to erase all their footprints”, “seeking to experience death without dying”, “trying to become massless”, etc.
  • Mythic Locations: Abstract or surreal settings such as Rust Cathedral, The Veil’s Scar, The Unfinished, Blood Memory, Moon’s Call, or The Hypercube.
  • Key Artifacts: Objects of immense symbolic weight like the tornado-spiraled Noble’s Last Coin, the lie-embodied Druid’s Moonbeam, or the chance-embracing Fateweaver’s Needle.
  • Narrative Twists: Reality-bending stakes — e.g., “it contains entire worlds within its structure” or “it's made from the moment before any decision is made.”
  • Antagonists: Villains like the Chrono Leviathan, Quantum Revenant, or Final Emissary — each with distinct motives and cultural signatures.
  • Cultural Aesthetics: Embedded flavor like shadowfell echoes, banshee scream-dampening acoustics, or aasimar blessing inscriptions.
  • Fixed Timeline Anchor: Every story unfolds on September 6, across millennia (+11591619 to +23903588+), reinforcing continuity and symbolic cyclicality.

🛠️ Technical Format:

  • 100,000 fully structured entries
  • Fields:
    protagonist, motivation, location, artifact, antagonist, cultural_influence, twist, theme, timestamp, prompt_text
  • Machine-parseable, ideal for LLM pretraining, fine-tuning, RAG pipelines, procedural storytelling engines, or search-based prompt chaining.

đź§  Built For:

  • LLM Developers: Craft models with richer story synthesis, thematic recursion, and symbolic reasoning.
  • Game & Narrative Designers: Generate mythic quests, unique NPC motivations, artifact lore, and campaign arcs with continuity.
  • Writers & Worldbuilders: Jumpstart creative ideation, map interconnected mythologies, or build serialized fiction from modular concepts.
  • Researchers: Explore how neural models interpret structured abstraction, motif repetition, and cultural drift over time.

🗝️ Why This Dataset?

  • Not “random prompts” — this is a cohesive symbolic simulation across time and worlds.
  • Themes evolve, characters recur, timelines expand.
  • Custom-built for emergent storytelling, symbolic reasoning, and meta-narrative structure.

🧬 “Imagine if Joseph Campbell, Borges, and DALL·E trained a universe together — this is the dataset that feeds it.”


📥 Instant Download

  • âś… 100k records
  • âś… Commercial use license
    Copyright © C.J. Jones, 2025By purchasing this dataset, you are granted a non-exclusive, worldwide, perpetual license to use, modify, and distribute the dataset and its derivatives for commercial and non-commercial purposes, subject to the terms below:

    âś… You May:

    • Use the dataset in commercial applications, products, models, and research.
    • Modify, adapt, and build upon the dataset for any use.
    • Redistribute derived works, models, or outputs generated using the dataset.

    ❌ You May Not:

    • Resell, redistribute, or repackage the raw dataset itself in its original form.
    • Claim exclusive ownership or authorship of the dataset.
    • Use the dataset for unlawful, harmful, or deceptive purposes.

    đź’¬ Attribution (Optional):

    Attribution is appreciated but not required.Note: This dataset is being sold to fund the development of a 500M -1B, fully transparent GPT community model. All data used in the model will eventually be made open to the public with it's release.

    ⚠️ Disclaimer:

    The dataset is provided "as is", without warranties or guarantees of any kind. The creator assumes no liability for any direct or indirect damages resulting from the use of the dataset.
Buy this

This dataset provides 100,000 richly structured, continuity-aware story prompts designed to enhance an LLM’s ability to handle symbolic reasoning, long-term narrative structure, character motivation, abstract locations, and mythic artifacts. Each entry follows a consistent schema with recurring characters, evolving themes, embedded twists, and culturally distinct antagonists—perfect for training models in storytelling, world coherence, narrative abstraction, and generative depth. Ideal for fine-tuning LLMs to generate, analyze, or extend high-concept fiction across thousands of interconnected timelines.

Size
83.6 MB