Well, I also wondered what Amazon's Cloud was doing in the drone example (see this figure from the cited article). It turns out it does the voice recognition (apparently with Amazon's "Alexa" service).
BTW, the drone article (didn't read the babyphone one) gives a step-by-step instruction how to setup the different programs. Could be useful also to others wanting to use speech recognition for whatever. Although, given the example phrases in the article such as "Alexa talk to Drone”, “Command Launch”, “Go forward 10 feet” (especially the last one) I wonder whether Alexa can do grammar or whether one has to generate a new command for each different amount of feet to move.