Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?