I have never been able to stand more than 5 minutes of a MOOC video before telling myself 'OK, I'll find a proper textbook.'.
I usually have a basic view of the MOOC topic ; at least the textbook allows me to skim it and dig deeper on the points that I'm interested in.
Just sitting at my desk and watching a video is usually boring and requires to watch the complete segment before realising it was not what I was looking for.
The same goes for all these video tutorials : why bother making a 5-min youtube video on some software installation when a one-page text with command lines would be appropriate?
The real problem is inappropriate use of media.
Video, text, pictures/photos all are appropriate, depending on what you're doing.
A picture or illustration is ideal if you need to show something visual - like a map. Doing so in words makes text verbose and inevitably, unclear. (Remember the saying a picture is worth a thousand words?). however, that doesn't mean you simply put up a gallery with no explanatory text. Pictures don't convey actions, but they help illustrate. A tutorial showing screenshots with explanatory steps between each screenshot is way more useful than a tutorial without (even ASCII art is still an illustration), or a tutorial consisting of just pictures.
Likewise, video has its place. It's use is to show people an action. Not action as in "click this button", but complex actions that cannot be shown by a mere image and explanatory text. Perhaps a part is particularly hard to remove and requires a tricky amount of manipulation - text and illustrations help, but it should ALSO be supplemented with a video showing it visually. Note: you need ALL THREE methods for this - video alone is insufficient. Text with video isn't sufficient either (you cannot follow the text and video simultaneously), and video and pictures isn't sufficient either (see text and images).
Oh, and no, videos must be properly produced with proper sound tracks and narration. And even more importantly, angles must be the same between the illustration and video. Sorry, but if the only place you can film from is upside down, then you should show your illustrations from that POV as well so users can correlate the illustration with the action. If alternative POVs are more appropriate, you must illustrate the POV shown in the video so users do not waste the entire time trying to figure out the POV.
Text is descriptive. It tells what to do.
Illustrations, photos, pictures are visual. They show things at static points in time and are useful tor pointing out, illustrating actions, or providing visual information.
Videos show action that cannot be captured by mere text nor illustrations. Perhaps a location is tricky to get to - a drawing can show you where it is, but it can't show you how to get there or what you should see. (Not without a lot of shots sequentially taken, that is). Oh, and videos must be long enough to show a lead in (how you got there), the action, and a lead out. Jumping right into the action is bad when viewers are trying to orient themselves at first. If you're showing a jumping action to reach a hidden room, you need to start from a familiar room, pause there for a couple of seconds to let viewers get bearings by showing landmarks, then proceed.
Truth is, you need all three media to teach or do anything effectively. The real problem is too much over-reliance of one medium or another - typically too much video, not enough text nor illustrations. No media is perfect - they all have their shortcomings and one cannot be substituted for another.
Text is always a must. If anything it's the backbone of the whole thing and it serves to tie the other media together to form the cohesive unit of what you're trying to communicate. You cannot use the other media without text. Next level up is illustrations, pictures, photos, and other static displays. Again, text must tie together the visual display - the visual cannot exist on its own. And a visual is not a substitute for text - if you're describing a series of steps or a location, it has to be written out, then called out on the visual.
Finally, comes the video, which should be used sparingly and appropriately. It's inappropriate to show the whole thing as a video when most of the steps are static (meaning static graphics are more ideal). Complex actions are suitable for video as long as properly decorated.
Of course, this is VERY HARD WORK. Most people are lazy which is why we get the crap we do. It takes real work to put together something appropriate and it cannot be slap-dashed together in a few hours.