Comment Re:Mathematician commentary included (Score 1) 81
do llms perform logical inference
I put in your query into Gemini. It said
The short answer is yes, but not in the way a traditional computer program or a human logician does. LLMs can successfully solve complex logic puzzles, deduce conclusions from premises, and write flawless mathematical proofs. However, because they are built on probabilistic text prediction rather than an explicit "logic engine," their reasoning capabilities have fascinating quirks and strict limitations.
It might give a different answer because of memories or just plain randomness. Or it might be that you actually had a longer conversation and pushed it your way. Those models love to please. I hope you didn't just try to get it to agree with you.
Then I ask "How is their logical inference similar to humans particularly when they are allowed to use reasoning techniques and tools", and it said.
When we give an LLM the space to "think" (via extended reasoning chains) and the ability to interact with tools (like code interpreters, calculators, or search engines), its process begins to look remarkably similar to human dual-process cognition. Psychologists often map human intelligence into two systems: System 1 (fast, intuitive, subconscious, and prone to cognitive biases) and System 2 (slow, deliberate, analytical, and rule-following). By default, a standard LLM operates purely like System 1â"it spits out the next word based on intuition and pattern matching. But when we equip them with reasoning techniques and tools, they effectively simulate a System 2 mind.
I would guess you are using these models the wrong way. Ask it to clarify issues and define terms. Push against what it says. I see a lot of people using these things as echo chambers to try and win an argument. They don't have the background to understand the topic, but they just skim the answer and query it to get what they are looking for. You can actually learn what generalization means in ML if you just ask the model. This knowledge will give you the ability to say some interesting.