3d audio = surround sound (5.1/7.1/8.1/etc)
"5.1/7.1/8.1" doesn't have an elevation component. Certain IMAX formats did, as did some experimental 70mm formats in the 70s, but it hasn't really been widely available before Dolby ATMOS and Barco Auro.
The big difference with the traditional X.Y formats is these regard individual screen channels as discrete, and when films are mixed, sound sources are hard-assigned to certain speaker channels, and the speaker placement has to be matched in every venue . "3D" systems use procedural methods to assign sound sources a vector or coordinate with metadata, and a decoder at the receiving end does the job of assigning speakers, which may have different placement and number from venue to venue.
Something mixed in 5.1 or 7.1 can be "downmixed" to stereo by summing channels together and applying pan and gain to position the multichannel sources in a stereo field. But a stereo signal can't really be "upmixed" to a 7.1, the position of individual sound sources is lost and can't really be extracted from the mix -- there are fancy ways of "spatializing" stereo mixes to 5.1 or 7.1 with fourier analysis and panning certain phase correlations or frequencies to different speakers, but there's really no way for a spatializer to split the celli from the violas and pan them separately, or the machine guns and the explosions.
3D audio formats keep violas and cellis on separate streams in the file, and then use position metadata to do the speaker mix in the receiver, so something mixed on stereo or 5.1 speakers could be unmixed to a 7.1, or 11.1, or 64 channel setup and you would actually get more fidelity.