Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror

Submission Summary: 0 pending, 18 declined, 2 accepted (20 total, 10.00% accepted)

Submission + - The Small World of English (inotherwords.app)

michaeldouma writes: We built a 1.5M word semantic network where any two words connect in ~6.43 hops (76% connect in 7). The hard part wasn't the graph theory—it was getting rich, non-obvious associations. GPT-4's associations were painfully generic: "coffee beverage, caffeine, morning." But we discovered LLMs excel at validation, not generation. Our solution: Mine Library of Congress classifications (648k of them, representing 125 years of human categorization). "Coffee" appears in 2,542 different book classifications—from "Coffee trade—Labor—Guatemala" to "Coffee rust disease—Hawaii." Each classification became a focused prompt for generating domain-specific associations. Then we inverted the index: which classifications contain both "algorithm" and "fractals"? Turns out: "Mathematics in art" and "Algorithmic composition." This revealed connections like algorithmFibonaccigolden ratio that pure co-occurrence or word vectors miss. The "Montreal Effect" nearly tanked the project—geographic contamination where "bagels" spuriously linked to "Expo 67" because Montreal is famous for bagels. We used LLMs to filter true semantic relationships from geographic coincidence. Technical details: 80M API calls, superconnector deprecation (inverse document frequency variant), morphological deduplication. Built for a word game but the dataset has broader applications.

Submission + - Semantic Word Games: From 1960s Origins to AI Tools (inotherwords.app)

michaeldouma writes: Games exploring word associations remain rare compared to spelling-focused word games. While Connections, Semantle, Codenames, and Taboo have broken through, there's little holistic examination of this sub-genre.

I've cataloged a dozen major and minor semantic games, from Borgmann's 1967 synonym chains to modern AI-powered implementations using word vectors and neural embeddings. The collection includes commercial hits, research experiments, and two games I developed.

The most fascinating aspect: virtually any two English words can connect through conceptual "stepping stones" in 7 steps or fewer. Our implementation maps 1.1 million words with 60 million weighted connections, revealing the hidden structure of language itself.

Slashdot Top Deals

Any circuit design must contain at least one part which is obsolete, two parts which are unobtainable, and three parts which are still under development.

Working...