It gets worse :/
I looked up the brand (Invenda). Their PDF includes “using AI”, “measuring foot traffic”, and gathering “gender/age/etc” e.g. facial recognition to estimate a persons age and gender
And in terms of “stored locally” this is straight from their website
The machine comes with a “brain” – Invenda OS – and is connected to the Invenda Cloud, which allows you to manage it remotely and gather valuable environmental, consumer and transactional data. The device can be branded according to your requirements to further enhance your brand presence.
The marketing also so fricken backwards that it reads like satire:
For a consumer, there’s no greater comfort than shopping pressure-free. Invenda Wallet allows consumers to browse, select and pay for products leisurely and privately 🤦♂️
Unfortunately its got the hardware for it https://www.invendagroup.com/vending-machines
Its not really local only either, the cameras exist for the point of data harvesting, just look at their marketing. They only mean they’re not streaming video to a server for recognition. The after-recognition data is still sent to a server https://www.invendagroup.com/vending-machines
Go to the vendors website, and not only see hole for yourself, but look at them claiming to use AI to detect demographics.
PDF Link: https://a.storyblok.com/f/184550/x/e7435c019e/brochure-svm_generic-dark-netflix-ui.pdf
Their website: https://www.invendagroup.com/vending-machines
Here’s my own screenshots in case the vendor takes their pdf down from bad press.
Hole: https://photos.app.goo.gl/rTk8fUWynmXgw7Zc6
AI Features: https://photos.app.goo.gl/RiF1Phrroj65tMdd7
If we serve licensed content over ssh or HTTPS it’s still licensed. Protocols don’t change the legal requirements of the data. Warner Bros will still sue if one of their movies is hosted on a server using the activity pub protocol.
What? I’m saying every federated copy must legally must have the usage restrictions. Just cause it’s copied doesn’t mean it can go into a for-profit LLM.
it can apply across all of them, for example that’s how copy-left works
Sure, but it’s still true that there are legal protections we can add that make it not fair game for Lemmy. At best it would be unfair-game (illegal scraping of Lemmy)
It’s not fair game for for-profit bussinesses training LLM’s. That’s part of why Reddit made the move; so that companies would need to pay Reddit for access to the data for legally training models
Yeah, sorry if I’m not great at communicating. That’s exactly what I’m trying to point out when I said:
Even if we don’t federate with them, Meta can still harvest the data so we should add these protections regardless.
As opposed to a facebook-controlled server being the top search result for Lemmy.
I see why that’s confusing so I edited my comment just now
I think we can give facebook/threads the bad end of the bargin IF we have a data protections.
You know how powerful copy-left was for open source? I think we can do the same for Lemmy servers. We can have users agree (formally) that the data on a particular server cannot be used for training llvm’s advertisements, marketing profiles, etc, and make it legally binding.
Even if we don’t federate with them, Meta can still harvest the data so we should add these protections regardless. Maybe there is already something like this and I’m just unaware of it.
If we do add these protections and we ensure that the largest instance (e.g. Lemmy.world) is community controlled, I think it could work well for bringing more content to Lemmy.
I feel dumb for having to ask but what exactly is “active users half year” vs “active uses monthly”?
Is half year just mean one or more comments/upvotes in the last 6months?
Actually, that’s my point we CAN’T rank them.
It is totally impractical for us to correctly/effectively rank the absolute torrent of posts coming from reddit, and the result is that every high quality 1000-reddit-upvote post is surrounded by an ocean of straightup-spam 1-reddit-upvote posts.
Real Lemmy posts in a community are completely drowned out by bot posts. I can’t even find real users posting in a community because there’s so many bot posts.
I feel like any reddit that asks questions should not have a Lemmy bot. Like I just saw a bot for r/whatisthis and I just can’t understand the logic. Who would EVER answer a question when the OP can’t even see the response. I didn’t want to rant about it, but really, how can we get in contact with the bot creators???
Meanwhile, something as simple as calvin and hobbs clips are being manually posted on a daily schedule by a regular Lemmy user.
Not even just trolls, I have yet to come across a clearly-bad post that wasn’t already downvoted to oblivion, or a clearly-good post that had a negative total. And the csam response? Straight up world-class defense system faster than any megacorp could’ve scrambled together.
Lemmy users are anything but passive when it comes to trash showing up in the feed.
I’ve been tracking the version search issue on nixpkgs for the last 3 years. Your site is a major win on that front!
I still use Lazamar’s tool regularly, so I’m thrilled to see a successor to it.
Devbox (a framework) has the most exhaustive version search for nix that I know of, but it’s a bit of a pain because they assume you use devbox to install the package (instead of using nix to install the package)
I mean yeah, but I do like knowing who and in what way.
Pease consider the opposite; if a fork is needed at some point in the future, we need people who are familiar with the codebase. It would be, for example, much better for 3 of 4 contributors to be sane than only 2 of 3.
What really bothers me is the “measuring foot traffic”. I already refuse to use vending-machines because of the pricing and unhealthyness, but you’re telling me I need to make GDPR takedown requests just for walking to class?