• wizardbeard@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      14
      ·
      4 months ago

      On the highest level, they have a constant firehose of as much audio data from a sea of customers as they wish.

      Send it to cheap overseas transcribers, use it to train and improve voice recognition and automatic transcription.

      Have a backchannel to television viewing and music listening patterns.

      Know when different customers are home or not, improving demographics data.

      Know what is discussed within the house for data on ad penetration/reach, brand awareness, and better advertisement targeting.

      It’s not a direct data to money pipeline, but having an always on listening device in someone’s home nets you a ton of useful data as an online retailer and advertiser.

      • Emily (she/her)@lemmy.blahaj.zone
        link
        fedilink
        arrow-up
        12
        ·
        4 months ago

        having an always on listening device in someone’s home

        They very explicitly do not collect audio when you haven’t used a wake word or activated it some other way. They will not “know what is discussed within the house for data on ad penetration/reach” (which is pretty much the only valuable data you’ve mentioned here), nor will they “have a backchannel to television viewing and music listening patterns” unless you actively discuss it with your device.

        I’m not going to put words in your mouth, but if whoever reads this is thinking of replying “are you going to trust that” etc, yes I am. We can track which data an Alexa transmits in real time and directly verify this “always listening” isn’t happening. Even if we couldn’t independently verify that his is the case, and lets say they contradict their privacy policy and public statements and do it anyway, that’s a crazy liability nightmare. Amazon has more than enough lawyers to know that unconsentually recording someone and using that data is very illegal in most places, and would open them up to so many lawsuits if they accidentally leaked or mishandled the data. Take the conspiracy hat off and put your thinking cap on.

        Send it to cheap overseas transcribers, use it to train and improve voice recognition and automatic transcription.

        Bad for privacy, but also not a $25 billion dollar source of revenue.

        Alexa, Google Home, and Siri devices are not good sources of data. If they were, why would Google, king of kings when it comes to data collection, be cutting their Assistant teams so much?

          • Emily (she/her)@lemmy.blahaj.zone
            link
            fedilink
            arrow-up
            6
            ·
            edit-2
            4 months ago

            It’s a good thing their reason is explained very clearly in the article linked in this post. They believed Alexa would have a high “downstream impact”, i.e.generate sales or subscriptions elsewhere in the company. Which it has so far failed to do.

              • Emily (she/her)@lemmy.blahaj.zone
                link
                fedilink
                arrow-up
                6
                ·
                4 months ago

                Can you explain to me exactly how moving where profit is recorded from one division to another in the same organization reduces their tax burden? Because, excuse me, I know I only did a year or two of accounting courses before dropping the degree, but that’s not how I understand taxes to work.

                Also to be turning a profit by “doing well collecting data”, the open market value of the data Alexa alone annually generates would need to be around 8% of the entire global data market. If you can justify how millions of instances of “Alexa set a timer for 10 minutes”, “Alexa what is the weather”, or “Alexa play despacito” generates that much value, maybe you have a point.

          • Emily (she/her)@lemmy.blahaj.zone
            link
            fedilink
            arrow-up
            3
            ·
            4 months ago

            Read the next paragraph, I already addressed you armchair conspiracy theoriests. We can independent verify their claims by analysing the device’s network traffic, I’ve literally done it myself and seen with my own eyes that it doesn’t happen. If you don’t believe me, you can also check for yourself.

              • Emily (she/her)@lemmy.blahaj.zone
                link
                fedilink
                arrow-up
                3
                ·
                edit-2
                4 months ago

                How much computing power do you think it takes to approximately recognise a predefined word or phrase? They do that locally, on device, and then stream whatever audio follows to more powerful computers in AWS (the cloud). To get ahead of whatever conspiratorial crap you’re about to say next, Alexa devices are not powerful enough to transcribe arbitrary speech.

                Again, to repeat, people smarter than you and me have analysed the network traffic from Alexa devices and independently verified that it is not streaming audio (or transcripts) unless it has heard something close (i.e close enough such that the fairly primative audio processing (which is primitive because it’s cheap, not for conspiracy reasons) recognises it) to the wake word. I have also observed this, albeit with less rigorous methodology. You can check this yourself, why don’t you do that and verify for yourself whether this conspiracy holds up?

                  • Gaywallet (they/it)@beehaw.org
                    link
                    fedilink
                    arrow-up
                    4
                    ·
                    4 months ago

                    I’m going to treat you with good faith and assume you were using “cool man” in the same way someone might say “that’s just like your opinion man”, as a saying, but I will remind you that this person has their pronouns in their display name and you need to respect them.