Sperm whales communicate with each other using rhythmic sequences of clicks, called codas. It was previously thought that sperm whales had just 21 coda types. However, after studying almost 9,000 recordings, the Ceti researchers identified 156 distinct codas. They also noticed the basic building blocks of these codas which they describe as a “sperm whale phonetic alphabet” – much like phonemes, the units of sound in human language which combine to form words.

Pratyusha Sharma, a PhD student at MIT and lead author of the study, describes the “fine-grain changes” in vocalisations the AI identified. Each coda consists of between three and 40 rapid-fire clicks. The sperm whales were found to vary the overall speed, or the “tempo”, of the codas, as well as to speed up and slow down during the delivery of a coda, in other words, making it “rubato”. Sometimes they added an extra click at the end of a coda, akin, says Sharma, to “ornamentation” in music. These subtle variations, she says, suggest sperm whale vocalisations could carry a much richer amount of information than previously thought.

  • Zaktor@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    4
    ·
    edit-2
    25 days ago

    I’m in “artificial intelligence” and write software. I’m not looking for a product name. If they developed an entirely new technique that’d also be cool to mention, but they probably built on existing techniques and were at least working in some broad form of AI.

    Edit: After looking at the paper, it’s the “misused” category. They were doing regular scientific data analysis.

    • Match!!@pawb.social
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      25 days ago

      The github shows them doing some k nearest neighbors and kernel estimation as part of their understanding of the coda (I think? I don’t know anything about whales)

      • Zaktor@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        2
        ·
        25 days ago

        The KMeans is actually an unused import, but yeah, I see the kernel estimation. They also use a gaussian mixture in one of the lower cells. So a little AI.