This is impossible for an AI or even a human to do. An AI will never be able to tell just from a photo if you glass has Coke or Diet Coke, if your dressing is full-fat or light, if that soup was made with cream or milk, or if your cupcake is made from fortified flour or a gluten free, unfortified alternative. Thinking it can is just a pipe-dream. But AI is hot so got to put AI in everything. I saw a post with an AI ready screen protector for a phone the other day. WTF?
What they could do is have AI identify how many main dishes and sides are in a shot, take general guesses as to what they are, then prompt the user for details. It might save a bit of time on the user's part, but probably not enough to be worth the time, effort, and inference cost to do it.