1. This demo was likely created by an engineer or sales person with SoundHound. More impressive would be a demo by a third party journalist or reviewer without a vested interest.
2. The impressive speed probably won't scale to the millions of simultaneous users Siri, Google Now, and Cortana support (assuming audio is processed in the cloud, which I admittedly don't know for sure).
3. Obviously the demo uses phrases that work. I guarantee you an ordinary person will often get "Sorry, I didn't understand the question" or whatever SoundHound's equivalent is.
4. While it sounds impressive at first blush, nobody really cares how many days it is between next Tuesday and Christmas of 2025. And that happens to be not only useless, but also pretty easy to special-case in your expert system / AI logic. So how about a demo that answers the question: "How can you make a mushroom omelette without soggy mushrooms?"