Anthropic Launches Improved Version of Its Entry-Level LLM

Anthropic Launches Improved Version of Its Entry-Level LLM (techcrunch.com) 5

Posted by BeauHD on Thursday August 10, 2023 @10:02PM from the new-and-improved dept.

Anthropic, the AI startup co-founded by ex-OpenAI execs, has released an updated version of its faster, cheaper, text-generating model available through an API, Claude Instant. TechCrunch reports: The updated Claude Instant, Claude Instant 1.2, incorporates the strengths of Anthropic's recently announced flagship model, Claude 2, showing "significant" gains in areas such as math, coding, reasoning and safety, according to Anthropic. In internal testing, Claude Instant 1.2 scored 58.7% on a coding benchmark compared to Claude Instant 1.1, which scored 52.8%, and 86.7% on a set of math questions versus 80.9% for Claude Instant 1.1. "Claude Instant generates longer, more structured responses and follows formatting instructions better," Anthropic writes in a blog post. "Instant 1.2 also shows improvements in quote extraction, multilingual capabilities and question answering."

Claude Instant 1.2 is also less likely to hallucinate and more resistant to jailbreaking attempts, Anthropic claims. In the context of large language models like Claude, "hallucination" is where a model generates text that's incorrect or nonsensical, while jailbreaking is a technique that uses cleverly-written prompts to bypass the safety features placed on large language models by their creators. And Claude Instant 1.2 features a context window that's the same size of Claude 2's -- 100,000 tokens. Context window refers to the text the model considers before generating additional text, while tokens represent raw text (e.g. the word "fantastic" would be split into the tokens "fan," "tas" and "tic"). Claude Instant 1.2 and Claude 2 can analyze roughly 75,000 words, about the length of "The Great Gatsby." Generally speaking, models with large context windows are less likely to "forget" the content of recent conversations.

Anthropic Launches Improved Version of Its Entry-Level LLM

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 5 Comments Log In/Create an Account

Comments Filter:

Yawn (Score:2, Insightful)

by Anonymous Coward writes:

Yawn
Intelligence? (Score:3)

by gizmo2199 ( 458329 ) writes: on Friday August 11, 2023 @09:27AM (#63758708) Homepage

>"'hallucination' is where a model generates text that's incorrect or nonsensical,"
Any algorithmically-generated output is as good as any other to a LLM because they're not intelligent. They're procedurally generating outputs based on inputs. It's only the humans that care what the answer is.
What's amazing to me is the head-long rush by techno-utopians and Wall street into "AI" language models that can so easily generate nonsense.

but does it have a soul? (Score:2)

by carnivore302 ( 708545 ) writes:

I guess not.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Anthropic Launches Improved Version of Its Entry-Level LLM (techcrunch.com) 5

Anthropic Launches Improved Version of Its Entry-Level LLM More Login

Anthropic Launches Improved Version of Its Entry-Level LLM

Yawn (Score:2, Insightful)

Intelligence? (Score:3)

but does it have a soul? (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot