Metadata is just data about data. This can be almost anything. For voice recordings, you could reasonably claim the following information to be metadata:
- Existence of keywords or keyphrases
- Voice signatures, identifying the speakers
- Stress levels of the voices

If you look at how US agencies are gaming the legal system, they will probably claim that transcripts of conversations are not the conversations themselves and therefore metadata.

