I find that many multimodal sentiment analysis datasets(like CMU-MOSI) use the COVAREP to extract the audio features(74-dimensions). But i'm not familiar with Matlab. So, i wonder if there are some way for me to get the same features as COVAREP using Python?
Asked
Active
Viewed 170 times