Abstract: This paper introduces the CAT model, achieving 98.9% accuracy on the ShipsEar dataset by combining CNN and ViT with a fusion and separation mechanism for underwater acoustic signal ...