Forgot your password?
typodupeerror

Submission + - Introducing...book2screenplay from Crankshaft News from Shyamal Chandra (google.com)

advancecoder writes: Abstract:
We present book2screenplay, an end-to-end system that transforms a PDF book into (1) a production-formatted screenplay rendered with the screenplay.cls LaTeX class and (2) a narrated 4K picture-video with per-scene imagery, multi-voice text-to-speech, and optional rich caption overlays. The pipeline is implemented in Rust with tokio-based concurrency and orchestrates a hierarchy of local large language model (LLM) agents served by multiple Ollama instances. A Director agent produces a story bible and beat structure; parallel Writer agents draft Fountain-structured scenes; Continuity and Polish passes reconcile tone and formatting; image and speech stages materialise the screenplay for audiovisual output. We report architectural findings from development on a 96 GB Apple Silicon Mac Studio: memory is dominated not by the Rust orchestrator but by concurrent model residency and 4K ffmpeg encodes; explicit model eviction and KV-cache quantisation reduce peak resident set from 70–90 GB to 48 GB without sacrificing throughput on a two-server Ollama pool. We document failure modes—JSON truncation, schema drift in model output, and optional-filter absence in minimal ffmpeg builds—and the mitigations adopted (context budgeting, lenient deserialisation, capability probing). The system demonstrates that fully local, agentic book-to-film pre-visualisation is technically feasible for feature-length targets, though page-count fidelity and cross-scene continuity remain open research problems.

More details coming soon!

Submission + - Introducing...Agentic Parallel Book Writer from Domino Data Systems. (google.com)

advancecoder writes: Abstract:
Generating a coherent full-length novel (300 printed pages, 90,000–300,000 words) from a single prompt is well beyond the effective context window of every publicly available large language model in 2024–2026. We present AGENTICBOOKWRITER, an industrial-grade system written in Rust that achieves this by decomposing the problem into a four-tier hierarchy of specialized agents—a Director, a Planner, a pool of Writers, and a pool of Reviewers—each implemented as a prompt template against a pool of locally hosted Ollama instances [Ollama contributors, 2024, Gerganov and contributors, 2023]. A shared “canon” data structure (synopsis, world bible, character sheets, scene beats) is serialized to a single JSON file that doubles as an atomic, scene-granular checkpoint, making a multi-hour run crash-safe and resumable. We engineer aggressive parallelism (up to 16 concurrent writer requests on a 28-core Apple Silicon Mac Studio) through a port-sharded pool of Ollama servers, guarded by a per-request num_ctx auto-sizer and a transient-error retry classifier that neutralizes the known GGML_ASSERT KV-cache bug in llama.cpp. On llama-3.1:8b we generate a readable 90,000-word first draft in 48–80 minutes wall-clock; on a 300,000-word target the same pipeline completes in 3–4 hours with a graceful fallback that has never, in our measurements, lost a chapter to a model crash. We contribute: (i) a canon-plus-fallback prompting protocol that tolerates small-model schema drift, (ii) a tier-specific concurrency bound that keeps the KV-cache stable under long prompts, (iii) scene-granular resume semantics, and (iv) a reproducible single-binary Rust implementation.

More details coming soon!

Comment Questions for GNU Kind Communications Guidelines (Score 1) 448

On another note, here are my thirty questions due to the loaded, lossy, and opinionated terminology dictating the GNU Kind Communication Guidelines (https://www.gnu.org/philosophy/kind-communication.html). Please let me know if there is any confusion because I am totally confused and as a user and modder of GNU software in the past, present, and future, I feel it is not only my obligation but my duty to respond to this post on the gnu.org website. 1. What do you mean by advance? 2. What do you certain patterns of communication mean? 3. What do you mean by conscious effort? 4. What do you mean by specific? 5. What do you mean by good faith? 6. What do you mean by wrongs? 7. What do you mean by harsh tone? 8. What do you mean by accept it? 9. What do you mean by actually say and actually do? 10. What do you mean by preferences? 11. What do you mean by personal attacks? 12. What do you mean by hit back? 13. What do you mean by feelings? 14. What do you mean by private? 15. What do you mean by peace? 16. What do you mean by anger? 17. What do you mean by typical desires, capabilities, or actions? 18. What do you mean by kind? 19. What do you mean by conscientious? 20. What do you mean by mistakes? 21. What do you mean by nonfree? 22. What do you mean by not allowed? 23. What do you mean by exaggerations? 24. What do you mean by constructive criticism? 25. What do you mean by real views? 26. What do you mean by tangent? 27. Why not have a special hotline for GNU software w.r.t. that repository? 28. What do you mean by comfortable? 29. What do you mean by cater? 30. What do you mean by friendlier?

Comment Questions for Malcolm Gladwell! (Score 4, Interesting) 111

Ten-thousand hours (~3.4 years if a regular job) is Gladwell's estimate of temporal mastery. With that being said, the Mozarts like Carlsen or Fischer learn faster and become World Champion. What is the difference between the Mozarts and 3.4 years? Is it there some passionate rage to absorb and decipher patterns that magnetizes them to a particular domain or is it their consistent, well-designed regimen for reaching the upper echelons (like Lalzo Polgar's systemic approach with Judith and Susan)? If it is "pure" passion, then will people who find their true calling and invest appropriate time (e.g. have an OCD mentality) always see the unquestionable results? If it is "pure" regimen, then will following the same systematic approach always see the overarching performance? One thing to keep in mind is are these skills transferable to other domains? Is there a way to tackle a number of domains in the same 10,000 hours with an abstract approach? What about the time to create "new" domains rather than to "solve" problems in a particular domain? Is there some sort of estimate for that? Malcolm could possibly use those clues for his sequel to "Outliers" appropriately called "Pioneers". Any thoughts?

Slashdot Top Deals

1 Dog Pound = 16 oz. of Alpo

Working...