Forgot your password?
typodupeerror

Submission Summary: 2 pending, 1 declined, 0 accepted (3 total, 0.00% accepted)

Submission + - Introducing...book2screenplay from Crankshaft News from Shyamal Chandra (google.com)

advancecoder writes: Abstract:
We present book2screenplay, an end-to-end system that transforms a PDF book into (1) a production-formatted screenplay rendered with the screenplay.cls LaTeX class and (2) a narrated 4K picture-video with per-scene imagery, multi-voice text-to-speech, and optional rich caption overlays. The pipeline is implemented in Rust with tokio-based concurrency and orchestrates a hierarchy of local large language model (LLM) agents served by multiple Ollama instances. A Director agent produces a story bible and beat structure; parallel Writer agents draft Fountain-structured scenes; Continuity and Polish passes reconcile tone and formatting; image and speech stages materialise the screenplay for audiovisual output. We report architectural findings from development on a 96 GB Apple Silicon Mac Studio: memory is dominated not by the Rust orchestrator but by concurrent model residency and 4K ffmpeg encodes; explicit model eviction and KV-cache quantisation reduce peak resident set from 70–90 GB to 48 GB without sacrificing throughput on a two-server Ollama pool. We document failure modes—JSON truncation, schema drift in model output, and optional-filter absence in minimal ffmpeg builds—and the mitigations adopted (context budgeting, lenient deserialisation, capability probing). The system demonstrates that fully local, agentic book-to-film pre-visualisation is technically feasible for feature-length targets, though page-count fidelity and cross-scene continuity remain open research problems.

More details coming soon!

Submission + - Introducing...Agentic Parallel Book Writer from Domino Data Systems. (google.com)

advancecoder writes: Abstract:
Generating a coherent full-length novel (300 printed pages, 90,000–300,000 words) from a single prompt is well beyond the effective context window of every publicly available large language model in 2024–2026. We present AGENTICBOOKWRITER, an industrial-grade system written in Rust that achieves this by decomposing the problem into a four-tier hierarchy of specialized agents—a Director, a Planner, a pool of Writers, and a pool of Reviewers—each implemented as a prompt template against a pool of locally hosted Ollama instances [Ollama contributors, 2024, Gerganov and contributors, 2023]. A shared “canon” data structure (synopsis, world bible, character sheets, scene beats) is serialized to a single JSON file that doubles as an atomic, scene-granular checkpoint, making a multi-hour run crash-safe and resumable. We engineer aggressive parallelism (up to 16 concurrent writer requests on a 28-core Apple Silicon Mac Studio) through a port-sharded pool of Ollama servers, guarded by a per-request num_ctx auto-sizer and a transient-error retry classifier that neutralizes the known GGML_ASSERT KV-cache bug in llama.cpp. On llama-3.1:8b we generate a readable 90,000-word first draft in 48–80 minutes wall-clock; on a 300,000-word target the same pipeline completes in 3–4 hours with a graceful fallback that has never, in our measurements, lost a chapter to a model crash. We contribute: (i) a canon-plus-fallback prompting protocol that tolerates small-model schema drift, (ii) a tier-specific concurrency bound that keeps the KV-cache stable under long prompts, (iii) scene-granular resume semantics, and (iv) a reproducible single-binary Rust implementation.

More details coming soon!

Slashdot Top Deals

"A child is a person who can't understand why someone would give away a perfectly good kitten." -- Doug Larson

Working...