Updates to Consumer Terms and Privacy Policy

troad · 2025-08-29T11:56:09 1756468569

You can opt out, but the fact that it's opt-in by default and made to look like a simple T/C update prompt leaves a sour taste in my mouth. The five year retention period seems... excessive. I wonder if they've buried anything else objectionable in the new terms.

It was the kick in the pants I needed to cancel my subscription.

wzdd · 2025-08-29T12:16:08 1756469768

Everywhere else in Anthropic's interface, yes/no switches show blue when enabled and black when disabled. In the box they're showing about this change the slider shows grey in both states: visit it in preferences to see the difference! It's not just disappointing but also kind of sad that someone went to the effort to do this.

senko · 2025-08-29T12:27:04 1756470424

Just did and it behaves as expected for me in the Android app (ie. not the dark pattern you described)

BalinKing · 2025-08-29T14:24:22 1756477462

I can confirm it's grey on both sides on the website.

tln · 2025-08-29T15:41:36 1756482096

I get blue (on) / black (off) on the website. Or blue / white in light mode.

https://claude.ai/settings/data-privacy-controls

It was easy to not opt-in, I got prompted before I saw any of this.

I think they should keep the opt-in behavior past Sept 28 personally.

IAmGraydon · 2025-08-29T18:00:09 1756490409

They’re likely A/B testing the interface change, which is why people are getting inconsistent results

Aurornis · 2025-08-29T15:27:25 1756481245

It works correctly (blue on, grey off) in the iOS app. I just did it now.

riz_ · 2025-08-29T12:20:00 1756470000

This is probably because there are laws in some countries that restrict how these buttons/switches can look (think cookie banners, where sometimes there is a huge green button to accept, and a tiny greyed out text somewhere for the settings).

soulofmischief · 2025-08-29T13:58:25 1756475905

Can you provide an example?

riz_ · 2025-08-29T14:20:57 1756477257

https://www.cnil.fr/en/dark-patterns-cookie-banners-cnil-iss...

gpm · 2025-08-29T17:09:44 1756487384

This link is not even remotely close to an example of the behavior you described.

merelysounds · 2025-08-29T12:19:45 1756469985

> opt-in by default

Nitpicking: “opt in by default” doesn’t exist, it’s either “opt in”, or “opt out”; this is “opt out”. By definition an “opt out” setting is selected by default.

benterix · 2025-08-29T12:42:56 1756471376

This is not nitpicking, this is a sane reaction to someone modifying the meaning of words on the fly.

klabb3 · 2025-08-29T12:49:08 1756471748

To be fair it trips people up all the time. Even precise terminology isn't great if people misuse it. Maybe it would have been better to just use "enabled by default".

troad · 2025-08-29T14:40:58 1756478458

The original meaning of sane is "physically healthy". Its usual modern meaning is "mentally healthy". You're using it to mean "reasonable".

At which exact point is language prohibited from evolving, and why it super coincidentally the exact years you learnt it?

danans · 2025-08-29T14:49:17 1756478957

> At which exact point is language prohibited from evolving

Never?

https://en.m.wikipedia.org/wiki/Semantic_change

troad · 2025-08-29T14:52:30 1756479150

Yes, that was my point.

card_zero · 2025-08-29T17:15:49 1756487749

And here it is, evolving before your eyes: we're killing off the maladaptive mutant which was "opt-in by default". That's the evolution that is happening here.

troad · 2025-08-29T23:41:41 1756510901

That would not be evolution, that would be an attempt at creationism. There is no evolution police, and never will be.

danparsonson · 2025-08-30T00:17:50 1756513070

Selection pressure is the evolution police.

card_zero · 2025-08-30T02:03:07 1756519387

It would be fair to compare it to selective breeding, rather than natural selection. The flip side of rejecting usage is promoting neologisms. We can do both things deliberately, I see no rule saying that language is only allowed to evolve naturally. A reasonable criticism would be that trying to change it on purpose makes for a lot of unnecessary fuss, but we can be moderate about it.

soraminazuki · 2025-08-29T17:16:26 1756487786

Diluting the distinction between opt-in and opt-out is gaslighting, not "evolution."

troad · 2025-08-29T23:49:26 1756511366

That seems like an ungenerous and frankly somewhat hysterical take.

By default, you are opted in. Perfectly clear.

The purpose of language is communication, not validating your politics.

soraminazuki · 2025-08-30T01:35:54 1756517754

> By default, you are opted in. Perfectly clear.

That's called opt-out. You're doing exactly what I described: gaslighting people into believing that opt-in and opt-out are synonymous, rendering the entire concept meaningless. The audacity of you labeling people as "political" while resorting to such Orwellian manipulation is astounding. How can you lecture others about the purpose of languages with a straight face when you're redefining terms to make it impossible for people to express a concept?

These are examples of what "opt-in by default" actually means. It means having the user manually consent to something every time, the polar opposite your definition.

- https://arstechnica.com/gadgets/2024/06/report-new-apple-int...

- https://github.com/rom1504/img2dataset/issues/293

It's also just pure laziness to label me as "hysterical" when PR departments of companies like Google have, like you, misused the terms opt-out and opt-in in deceptive ways.

https://news.ycombinator.com/item?id=37314981

Nevermark · 2025-08-30T07:22:34 1756538554

I completely agree with you from a correctness standpoint, ...

> Diluting the distinction between opt-in and opt-out is gaslighting

> That seems like an ungenerous and frankly somewhat hysterical take.

... however, this comment was a reasonable response.

Projective framing demonstrates your own lack of concern for accuracy, clarity or conviviality, that is 180 degrees at odds with the point you are making and the site you are making it on.

benterix · 2025-08-30T10:33:32 1756550012

I can somehow understand the parent. If you control the language, you control the discourse. This is like the famous "I'm appalled at the negativity here on HN" comment threads when doing product launches etc. Or using euphemisms to avoid calling spade a spade.[0] People are fed up with these tricks, hence these emotional reactions.

[0] https://news.ycombinator.com/item?id=26346688

tln · 2025-08-29T15:53:26 1756482806

> By definition an “opt out” setting is selected by default.

No, (IMO) an "opt out" setting / status is assumed/enabled without asking.

So, I think this is opt-in, until Sept 28.

Opt-in, whether pre-checked/pre-ticked or not, means the business asks you.

GDPR requires "affirmative, opt-in consent", perhaps we use that term to mean an opt-in, not pre-ticked.

whilenot-dev · 2025-08-29T16:30:34 1756485034

Regardless whether it's opt-in or opt-out, the business will need to confirm anything it opted for you by asking. If you don't select the opposing choice in a timely fashion, then the business assumes that it opted correctly in your interest and on your behalf.

> So, I think this is opt-in, until Sept 28.

If the business opted for consent, then you will effectively have the choice for refusal, a.k.a. opt-out.

I_am_tiberius · 2025-08-29T12:03:28 1756469008

"five year retention". If it's in a model once, it's there forever.

whimsicalism · 2025-08-29T13:41:55 1756474915

yes, it’s a very big loophole. and if it’s a generative model, you can just launder the data through synthetic generation/distillation to future models

Hnrobert42 · 2025-08-29T12:12:33 1756469553

Is that true? Do models get rebuilt from scratch each time or do they get iterated on?

I_am_tiberius · 2025-08-29T12:15:09 1756469709

I believe the big models currently get built from scratch (with random starting weights). That wasn't my point though. I meant a model created once, might be used for a very long time. Maybe they even release the weights at one point ("open source").

disconcision · 2025-08-29T14:49:05 1756478945

this is somewhat true but i'm not sure how load bearing it is. for one, i think it's going to be a while until 'we asked the model what bob said' is as admissible as the result of a database query

JohnnyMarcone · 2025-08-29T12:03:11 1756468991

I got a pop-up when I opened the app explaining the change and an option to opt out. That seems very transparent to me.

elashri · 2025-08-29T12:19:21 1756469961

> That seems very transparent to me

Implicit consent is not transparent and should be illegal in all situations. I can't tell you that unless you opt out, You have agreed to let me rent you apartment.

You can say analogy is not straightforward comparable but the overall idea is the same. If we enter a contract for me to fix your broken windows, I cannot extend it to do anything else in the house I see fit with Implicit consent.

handoflixue · 2025-08-30T10:01:50 1756548110

How is it "implicit" to click "I agree" to a large pop-up that takes up most of the screen?

danaris · 2025-08-30T13:04:49 1756559089

Courts in various jurisdictions have found clickwrap agreements to be generally only valid for what one would expect to be common provisions within such agreements.

Essentially, because they are presented in a form that is so easy to bypass and so very common in our modern online life, provisions that give up too much to the service provider or would be too unusual or unexpected to find in such an agreement are unenforceable.

mystraline · 2025-08-29T17:51:11 1756489871

As a real world counterexample, medical in the USA does this shit all the time.

Local office will do a blood draw, send it to a 3rd party analysis which isn't covered by insurance, then bill you full. And you had NO contractual relationship with the testing company.

Same scam. And its all because our government is completely captured by companies and oligopoly. Our government hasn't represented the people in a long time.

cube00 · 2025-08-29T12:07:02 1756469222

> That seems very transparent to me.

Grabbing users during start up with the less privacy focused option preselected isn't being "very transparent"

They could have forced the user to make a choice or defaulted to not training on their content but they instead they just can't help themselves.

felideon · 2025-08-29T13:18:58 1756473538

> seems very transparent

Except not:

> The interface design has drawn criticism from privacy advocates, as the large black "Accept" button is prominently displayed while the opt-out toggle appears in smaller text beneath. The toggle defaults to "On," meaning users who quickly click "Accept" without reading the details will automatically consent to data training.

Definitely happened to me as it was late/lazy.

ornornor · 2025-08-29T14:13:15 1756476795

It’s not. And also whether you move the toggle to on or off, you still have to click accept which really isn’t clear whether you’re accepting to share your data or not.

Never mind the complete 180 on privacy.

oblio · 2025-08-29T12:52:41 1756471961

Opt-in leads to very low adoption and is the moral choice.

Opt-out leads to very high adoption and is the immoral choice.

Guess which one companies adopt when not forced through legislation?

insane_dreamer · 2025-08-29T13:56:43 1756475803

It should be off be default, with the option to opt in.

DrillShopper · 2025-08-29T12:52:06 1756471926

It should be opt-in, not opt-out.

The fact that there's no law mandating opt-in only for data retention consent (or any anti-consumer "feature") is maddening at times

Joker_vD · 2025-08-29T12:26:12 1756470372

> You can opt out

You can say that you want to opt out. What Anthropic will decide to do with your declaration is a different question.

AlexandrB · 2025-08-29T13:48:56 1756475336

I look forward to this setting getting turned on again "accidentally" when new models are released or the ToS is updated.

monegator · 2025-08-29T12:23:50 1756470230

I'm super duper sure that my data won't be stored and eventually used if i opt out

jmward01 · 2025-08-29T22:37:03 1756507023

The 5 year is the real kicker. Over the next 5 years I find it doubtful that they won't keep modifying their TOS and presenting that opt out 'option' so that all it will take is one accidental click and they have all your data from the start. Also, what is to stop them from removing the opt out? Nothing says they have to give that option. 4 years and 364 days from now TOS change with no opt out and a retention increase to 10 years. By then the privacy decline will have already have been so huge nobody will even notice that this 'option' was never even real.

episteme · 2025-08-29T12:07:15 1756469235

What will you use instead? I’m finding Claude the best experience since ChatGPT 5 is so slow and not any better answers than 4.

teekert · 2025-08-29T12:14:56 1756469696

Granted, it is a stretch and not near the features of Claude (no code etc), but at least Proton's Lumo [0] is very privacy oriented.

I have to admit, I've used it a bit over the last days and still reactivated my Claude pro subscription today so... Let's say it's ok for casual stuff? Also useful for casual coding questions. So if you care about it, it's an option.

[0] https://lumo.proton.me/

soiltype · 2025-08-29T15:04:50 1756479890

Since I don't use LLMs to directly code for me, I'm going to (mis?)place my trust in Kagi assistant entirely for the time being. It claims not to associate prompts with individual accounts. Small friction of keeping a browser tab open is worth it for me for now.

nocommandline · 2025-08-29T16:54:15 1756486455

If you aren't using it for coding or advanced uses like video, etc, you can try running models locally on your machine using Ollama and others like it.

Self plug here - If you aren't technical and still want to run models locally, you can try our App [1]

1] https://ai.nocommandline.com

weregiraffe · 2025-08-29T14:35:42 1756478142

>What will you use instead? I’m finding Claude the best experience since ChatGPT 5 is so slow and not any better answers than 4.

You could try programming with your own brain

javierluraschi · 2025-08-29T12:42:50 1756471370

https://grok.com

ehnto · 2025-08-29T12:53:28 1756472008

From the frypan into the fire. I think the reality, proven by history and even just this short five years, is no company will hold onto their ethics in this space. This should surprise no one since the first step of the enterprise is hoovering up the worlds data without permission.

Arubis · 2025-08-29T12:56:37 1756472197

Worse by every measure.

weberer · 2025-08-29T15:10:19 1756480219

What metrics are you looking at? Grok 4 outperforms Claude 4 Opus in the Artificial Analysis Intelligence Index.

https://artificialanalysis.ai/leaderboards/models

mac-attack · 2025-08-29T13:24:07 1756473847

What sane person would downgrade to Grok

javcasas · 2025-08-29T12:46:03 1756471563

You can request your data to not be used. Your request will appropriately be read and redirected to /dev/null.

darepublic · 2025-08-29T17:00:06 1756486806

it's almost like this multi billion dollar company is misanthropic, despite their platitudes. Should I not hold my breath on Anthropic helping facilitate "an era of AI abundance for all"? (To quote a rejected PR applicant to Anthropic from the front page)

smallerfish · 2025-08-29T12:11:28 1756469488

Settings > Privacy > Privacy Settings

kossTKR · 2025-08-29T12:14:18 1756469658

i don't see any setting related to this? just:

Export data

Shared chats

Location metadata

Review and update terms and conditions

I'm in the EU, maybe that's helping me?

croes · 2025-08-29T12:20:42 1756470042

Have you clicked "Review and update terms and conditions"?

It's part of the update

kossTKR · 2025-08-29T12:23:21 1756470201

Oh i see thanks. That's a dark design pattern, hiding stuff like that.

No one cares about anything else but they have lots of superflous text and they are calling it "help us get better", blah blah, it's "help us earn more money and potentially sell or leak your extremely private info", so they are lying.

Considering cancelling my subscription right this moment.

I hope EU at leat considers banning or extreme-fining companies trying to retroactively use peoples extremely private data like this, it's completely over the line.

klabb3 · 2025-08-29T13:00:02 1756472402

EU or not, it baffled me that people don't see this glaring conflict of interest. AI companies both produce the model and rent out inference. In other words, you're expecting that the company that (a) desperately crave your data the most and (b) that also happen to collect large amounts of high quality data from you will simply not use it. It's like asking a child to keep your candy safe.

I'd love to live in a society where laws could effectively regulate these things. I would also like a Pony.

croes · 2025-08-29T14:30:02 1756477802

>It's like asking a child to keep your candy safe

That's why we don't hand billions of dollars to a child. Maybe we should treat AI companies similar.

kossTKR · 2025-08-29T13:16:55 1756473415

This is why we need actual regulation, and not the semi fascist monopolist corporatocracy we've evolved into now.

Its only utopian because it's become so incredibly bad.

We shouldn't expect less, we shouldn't push guilt or responsibility onto the consumer we should push for more, unless you actively want your neighbour, you mom, and 95% of the population to be in constant trouble with absolutely everything from tech to food safety, chemicals or healthcare - most people aren't rich engineers like on this forum and i don't want to research for 5 hours every time i buy something because some absolute psychopaths have removed all regulation and sensible defaults so someone can party on a yacht.

frm88 · 2025-08-29T14:45:10 1756478710

Bravo! This has to be the most coherent and well-formulated rant I have read in a longtime. Thank you!

kordlessagain · 2025-08-29T12:00:10 1756468810

> It was the kick in the pants I needed to cancel my subscription.

As if barely two 9s of uptime wasn't enough.

speckx · 2025-08-30T12:55:17 1756558517

I cancelled my subscription as well because of the opt-in by default.

ethagnawl · 2025-08-29T14:43:20 1756478600

I wonder what happens if I don't accept the new T&C? I've been successfully dismissing an updated T&C prompt in a popular group messaging application for years -- I lack the time and legal acumen to process it -- without issue.

Also, for others who want to opt-out, the toggle is in the T&C modal itself.

layer8 · 2025-08-29T15:04:48 1756479888

The new privacy policy automatically becomes effective on September 28, if you don’t already agree to it before. Anthropic states that “After September 28, you’ll need to make your selection on the model training setting in order to continue using Claude.”

nicce · 2025-08-29T15:00:45 1756479645

I tried to do that with WhatsApp and it eventually stopped working.

energy123 · 2025-08-29T14:57:12 1756479432

Has anyone asked why OpenAI has two very separate opt-out mechanisms (one in settings, the other via a formal request that you need to lodge via their privacy or platform page)? That always seemed likely to me to be hiding a technicality that allows them to train on some forms of user data.

demarq · 2025-08-29T12:05:03 1756469103

Are you sure the opt out isn’t only training? The retention does not seem affected by the toggle.

jasona123 · 2025-08-29T12:12:58 1756469578

From the PR update: https://www.anthropic.com/news/updates-to-our-consumer-terms

“If you do not choose to provide your data for model training, you’ll continue with our existing 30-day data retention period.“

From the support page: https://privacy.anthropic.com/en/articles/10023548-how-long-...

“If you choose not to allow us to use your chats and coding sessions to improve Claude, your chats will be retained in our back-end storage systems for up to 30 days.”

zenmaster10665 · 2025-08-29T12:32:17 1756470737

it seems really badly designed or maybe it is meant to be confusing. It does not make it clear that the two are linked together, and you have to "accept" the both together even though there is only a toggle on the "help us make the model better" item.

nicce · 2025-08-29T14:57:47 1756479467

OpenAIs temporary chat still advertises that chats are stored for 30 days while there is court order that everything must be retained indefinitely. I wonder why they are not obligated to state this quite extreme retention.

perihelions · 2025-08-29T12:00:08 1756468808

What are you replacing it with?

troad · 2025-08-29T12:12:15 1756469535

Two weeks left in the sub to figure it out, but I'm not yet sure. I was never all in on all the tooling, I mostly used it as smart search (e.g. ImageMagick incantations) and for trivial scripting that I couldn't be bothered writing myself, so I might just stick to whatever comes with Kagi, see if that doesn't cover me.

perihelions · 2025-08-29T12:17:04 1756469824

How does Kagi (claim that they) enforce privacy rights on the major LLM providers? Have they negotiated a special contract?

I'm looking at

> "When you use the Assistant by Kagi, your data is never used to train AI models (not by us or by the LLM providers), and no account information is shared with the LLM providers. By default, threads are deleted after 24 hours of inactivity. This behavior can be adjusted in the settings."

https://help.kagi.com/kagi/ai/assistant.html#privacy

And trying to reconcile those claims with the instant thread. Anthropic is listed as one of their back-end providers. Is that data retained for five years on Anthropic's end, or 24 hours? Is that data used for training Anthropic models, or has Anthropic agreed in writing not to, for Kagi clients?

FergusArgyll · 2025-08-29T12:40:13 1756471213

They are using llm's through the API where it's the b2b world and you can get privacy

vinnyorvinny · 2025-08-29T12:20:20 1756470020

There is an option to opt out right? So I assume they just make sure to always opt out.

fnordlord · 2025-08-29T12:33:30 1756470810

I'm mostly replying because I was truly using it for an ImageMagick incantation yesterday. I use the API rather than chat, if that's an option for you. I put $20 into it every few months and it mostly does what I need. I'm using Raycast for quick and dirty questions and AnythingLLM for longer conversations.

ivape · 2025-08-29T12:03:37 1756469017

I like think using OpenRouter is better, but there’s absolutely no guarantee from any of the individual providers with respect to privacy and no logging.

Syzygies · 2025-08-29T13:17:10 1756473430

Claude assists me in my math research.

The scenario that concerns me is that Claude learns unpublished research ideas from me as we chat and code. Claude then suggests these same ideas to someone else, who legitimately believes this is now their work.

Clearly commercial accounts use AI to assist in developing intellectual product, and privacy is mandatory. The same can apply to individuals.

vdfs · 2025-08-29T13:42:47 1756474967

> Claude assists me in my math research.

> Claude then suggests these same ideas to someone else, who legitimately believes this is now their work.

Won't this mean that claude assisted you with someone else work? Sure it's not from a "chat" but claude doesn't really know anything other than it's training data

kmacdough · 2025-08-29T17:26:15 1756488375

If you have an idea and are putting it together, you might use Claude for a few things:

- Search the web for related ideas. This could help if someone's already had the idea or if there are things to learn from related ideas. - Review your writeup or proofs for mistakes and clarity

None of these things make the idea Claude's. Claude merely helped with some of the legwork.

But Claude now has your idea in clear, plain text to train on. The next time someone hits on even a similar idea, Claude might well suggest your idea outright. Not seeing your idea published, the user has no way to know it isn't a novel idea. If the person is less diligent/thorough, they may well publish first and claim it as there own, without any nefarious intent.

simpaticoder · 2025-08-29T14:15:40 1756476940

There is a stark difference between using the public web to do research and searching through your colleagues' private notebooks and discussions to do research.

iaw · 2025-08-29T14:12:04 1756476724

> claude doesn't really know anything other than it's training data

I've seen cases where Claude demonstrates novel behaviors or combines existing concepts in new ways based on my input. I don't think it's as simple as memorization anymore.

ffsm8 · 2025-08-29T14:42:40 1756478560

If I am standing in Finland and look out on the ocean, and the whole sky is green... Is the sky actually green?

You're equating your own perspective as objective truth, which is a very common pitfall and fallacy

Syzygies · 2025-08-29T15:39:25 1756481965

To clarify, I see AI as an association engine of immense scope. Others are responding with variations on this model in mind.

It has long been a problem in math research to distinguish between "no one has had this idea" and "one person has had this idea". This used to take months. With the internet and MathSciNet, ArXiv online it took many iterations of guessing keywords. Now, I've spent six months learning how to coax rare responses from AI. That's not everyone's use case.

What complicates this is AI's ability to generalize. My best paper, we imagined we were expressing in print what everyone was thinking, when we were in fact connecting the dots on an idea that was latent. This is an interesting paradox: People see you as most original when you're least original, but you're helping them think.

With the right prompts AI can also "connect the dots".

Aurornis · 2025-08-29T13:18:25 1756473505

When you get the pop-up about the new terms, select the “opt out” option. Then your chats will not be used for training.

Klonoar · 2025-08-29T14:07:23 1756476443

Well, theoretically they won’t.

Anyone who’s worked in an engineering team is familiar with someone forgetting to check ‘if(doNotDoThisCondition)’.

This is why (among many other reasons) opt-in is more user respecting here than opt-out.

SoftTalker · 2025-08-29T14:15:35 1756476935

Forgetting. Riiighht.

Deegy · 2025-08-29T14:15:03 1756476903

If your work was truly novel, wouldn't the odds of it showing up in later models be extremely low given that these are probabilistic?

In a sense these machines are outputting the aggregate of the collective thoughts of the commons. In order for concepts to be output they have to be quite common in the training data. Which works out kind of nice for privacy and innovation because by the time concepts are common enough to show up through inference they probably deserve to be part of the public knowledge (IP aside).

bluecalm · 2025-08-29T14:28:10 1756477690

They might optimize learning to weight novel/unexpected parts more in the future. The better the models become (the more the expect) the more value they will get from unexpected/new ideas.

Deegy · 2025-08-29T14:35:37 1756478137

Good point. But can the models even behave that way? They depend on probability. If they put a greater weight on novel/unexpected outputs don't they just become undependable hallucination machines? Despite what some people think, these models can't reason about a concept to determine it's validity. They depend on recurring data in training to determine what might be true.

That said, it would be interesting to see a model tuned that way. It could be marketed as a 'creativity model' where the user understands there will be a lot of junk hallucination and that it's up to them to reason whether a concept has validity or not.

ceroxylon · 2025-08-29T14:51:37 1756479097

Temperature plays a large role in fine tuning model output, you're correct that there is a theoretical sweet spot:

https://towardsdatascience.com/a-comprehensive-guide-to-llm-...

bluecalm · 2025-08-30T10:00:44 1756548044

I think it's happening already. Chat GPT was able to connect my name to my project based on chess.com profile and one Hacker News post for example. It's not that hard to imagine that it learns a solution to a rare problem based on one input point. It may see one solution 1000 times an a rare solution 1 time and it can still be able to reference both.

thisOtterBeGood · 2025-08-29T13:51:10 1756475470

This perfectly describes one of the biggest dillema with AI. Where does an AI company stop to utilize human knowledge it does not actually own. Where do they draw the line. Apparently it's possible there aren't any lines drawn at all.

sneak · 2025-08-29T13:55:43 1756475743

You can’t own knowledge. Intellectual property is a legal fiction invented to prop up industries.

You can no more own knowledge or information than you can own the number 2.

AvAn12 · 2025-08-29T20:18:45 1756498725

So Anthropic should have no property rights to its own source code?

wolvesechoes · 2025-08-29T15:16:42 1756480602

Property itself is a legal fiction. Every other right you enjoy is a legal fiction.

So what?

sneak · 2025-08-29T17:47:28 1756489648

> Property itself is a legal fiction.

Maybe real property (which only exists because of a property record held in a government building), but it is self-evident to me (and, I believe, most people) that personal property is a natural right.

One only need look up some TikTok videos of Americans getting pickpocketed in Europe to see how large groups of people feel on the matter.

mitthrowaway2 · 2025-08-29T18:45:05 1756493105

But you won't feel the same way about a pickpocket who borrows the source code to the software you derive your livelihood from, your sales team's customer list, your would-be-bestselling novel manuscript, your company's secret formula for a rust-proof coating, or that scientific paper that you and your grad students have spent all summer getting ready to submit for publication?

Thank you for your generosity!

dns_snek · 2025-08-29T15:46:34 1756482394

If you talk to a human they're free to discuss your ideas with someone else. Why should LLMs be any different? The likelihood of these models reproducing your ideas word for word is essentially zero anyway.

More to the point, respecting your wishes to keep those conversations confidential would risk stifling human progress, so they have to be disregarded for the greater good.

const_cast · 2025-08-30T01:35:20 1756517720

> Why should LLMs be any different?

Because they're a computer program and not a human and humans are special.

Why are humans special? Because we're humans and we make the rules.

Its as inane as saying "why can I eat a burger but I can't chop up my friend and eat him? Why is that any different?"

dmbche · 2025-08-29T15:53:22 1756482802

Love to see people being directly and fully against the concept of "confidentiality"

dns_snek · 2025-08-29T16:41:16 1756485676

Not in the slightest! The only thing I'm against is hypocrisy.

LLM enthusiasts are staunch defenders of the argument that use of everyone's ideas and labour in LLM training isn't just fair use, but a moral imperative in order to advance science, art, and human progress as a whole.

It's beyond hypocritical for beneficiaries of this paradigm to then turn around and expect special treatment by demanding that "their" ideas, "their" knowledge, "their" labour be excluded from this process.

dmbche · 2025-08-29T16:50:23 1756486223

Gotcha - right with you. Gotta get my sarcasm detector checked.

andrewmcwatters · 2025-08-29T13:49:06 1756475346

A lot of people doing cat-and-mouse threat detection development are keeping their work outside of public LLMs right now, so it sounds like you’re in the same boat as a lot of us.

Ardren · 2025-08-29T14:00:06 1756476006

> Claude assists me in my math research.

Pulling up the ladder behind you :-)

notrealyme123 · 2025-08-29T15:56:45 1756483005

Unpublished work Vs. Published.

bluecalm · 2025-08-29T14:14:25 1756476865

Math research or anything new/clever in a particular niche. Imagine you optimized a piece of code to get an advantage or came up with some clever trick to solve a common problem in your niche and then everyone gets it from free from Claude believing, as you pointed out, that it's now their work.

I had this exact conversation with my business partner a few days ago. Our "secret sauce" might not be worth that much after many years but still I am not comfortable exposing it to Claude. Fortunately it's very easy to separate in our project so Claude gets the other parts and is very helpful.

JCM9 · 2025-08-29T12:15:43 1756469743

Not a surprise. All the major players have reached the limits of training on existing data—they’re already training on essentially the whole internet plus a bunch of content they allegedly stole (hence various lawsuits). There haven’t been any major breakthroughs in model architecture from the major players recently and thus they’re now in a battle for more data to train on. They need data, and they want YOUR data, now, and are gonna do increasingly shady things to get it.

klabb3 · 2025-08-29T13:10:20 1756473020

> They need data, and they want YOUR data, now, and are gonna do increasingly shady things to get it.

But unlike the 100s of data brokers that also want your data, they have an existing operational funnel of your data already that you voluntary give them every day. All they need is dark pattern ToS changes and manage the minor PR issue. People will forget about this in a week.

threetonesun · 2025-08-29T14:17:40 1756477060

Seems hard to believe legal teams at corporations are going to forget this in a week. I've always assumed the market play for these companies was spinning off an "Amazon basics" version of other companies software, this seems like another step towards that.

cube00 · 2025-08-29T13:12:51 1756473171

It's nice to see the newer models are suffering after being exposed to training on their own slop.

If they had done this in a more measured way they might have been able to separate human from AI content such as doing legal deals with publishers.

However they couldn't wait to just take it all to be first and now the well is poisoned for everyone.

theshackleford · 2025-08-29T13:57:15 1756475835

> It's nice to see the newer models are suffering after being exposed to training on their own slop.

I've seen zero evidence anything of the such is occurring, and that if it was, it's due to what you claim. I'd be highly interested in research suggesting both or either is occurring however.

cube00 · 2025-08-29T17:43:57 1756489437

"AI models collapse when trained on recursively generated data"

https://news.ycombinator.com/item?id=41058194

theshackleford · 2025-08-29T23:06:01 1756508761

That's not what I asked for as it's not relevant.

The claim was made that the models are "suffering", at this exact moment, because they have been recursively feeding themselves, RIGHT now.

I want evidence the current models are "suffering" right now, and I want further evidence that suggests this suffering is due to recursive data ingestion.

Some year old article with no relevance to today talking about hypotheticals of indiscriminate gorging of recursive data is not evidence of either of the things I asked for.

freejazz · 2025-08-29T14:40:49 1756478449

It's not alleged that they stole the content. They told the courts they pirated the materials.

whamlastxmas · 2025-08-29T16:32:06 1756485126

Infringement, not theft :)

freejazz · 2025-08-29T21:24:43 1756502683

Reread my post.

imiric · 2025-08-29T14:44:05 1756478645

Yeah, this is hardly surprising.

To AI companies, data is even more of a gold mine than to adtech companies. It is existentially important.

The truly evil behavior will emerge at the intersection of these two industries. I'm sure Google and Facebook are already using data from one to power the other, even if it's currently behind closed doors. I can hardly wait for the use cases these geniuses will think of once this is publicly acceptable and in widespread use by all companies.

xyst · 2025-08-29T13:14:45 1756473285

Further proof why guardrails/regulation is needed.

superposeur · 2025-08-29T14:16:55 1756477015

Everyone seems to be unsurprised by this move, but I’m genuinely shocked. What a shoot your own foot business decision. Google, evil though it be, doesn’t post the text of your gmails in its search results because who would consider using Gmail after that? This is the llm equivalent. Am I missing something?

rs186 · 2025-08-29T14:35:09 1756478109

Gmail used to serve ads based on your emails for many years until 2017. https://www.npr.org/sections/thetwo-way/2017/06/26/534451513...

skylurk · 2025-08-29T14:41:39 1756478499

And in 2010 they made https the default. Different times :)

j4hdufd8 · 2025-08-29T14:51:26 1756479086

I don't think https is responsible for that. Google owns the data, it doesn't matter how it is transported. It does, however, matter how it is stored (which I hope is encrypted in a way only you can retrieve it)

aleph_minus_one · 2025-08-29T14:32:03 1756477923

> Am I missing something?

I think you do:

According to the article https://www.perplexity.ai/page/anthropic-reverses-privacy-st...

"Enterprise and educational customers will continue operating under their existing privacy protections, as the policy changes specifically exclude Claude for Work and Claude for Education services. These commercial accounts remain governed by separate contractual agreements that maintain stricter data handling standards.

Organizations using Claude through business partnerships or educational licenses can continue their operations without concern for the new training policies affecting their sensitive communications or proprietary information."

Thus, I think your claim

> What a shoot your own foot business decision.

likely does not hold: the non-commercial accounts likely led to Anthropic loosing money, so they are not liked by Anthropic anyway (but are a an "inconvenient necessity" to get people to notice and try out your product offering). With this new decision, Anthropic makes this "free-riding" less attractive.

I bet that Anthropic will soon release a press statement (that exists in the drawers for quite a long time) "We are listening to your concerns, and will thus extend our 'privacy-conscious offering' to new groups of customers. Only 30 $ per month."

ceroxylon · 2025-08-29T14:45:40 1756478740

> With this new decision, Anthropic makes this "free-riding" less attractive

Certainly not for any users like you and me, it takes two seconds and three clicks to review the new terms and decline chat training. This is more like Anthropic getting easy training from people who are unaware or don't care.

soiltype · 2025-08-29T15:00:08 1756479608

Seems the same thing. They're giving plausible deniability, but knowing they'll still scoop up a worthwhile amount of data/profit from some % of users.

827a · 2025-08-29T15:00:58 1756479658

Gmail is free. It would still be incredibly bad for Gmail to start publishing the content of free users' emails to Google search.

But also, Anthropic has said that this new policy also applies to their Pro ($20/mo) and Max ($200/mo) plans. So its not free versus not free.

layer8 · 2025-08-29T14:45:10 1756478710

Well, it means that LLMs used for business use cases will be trained on input from non-business use cases of non-privacy-conscious users.

KoolKat23 · 2025-08-29T14:27:51 1756477671

This data is useful for reinforcement learning. All the others do it.

And most importantly, you can just opt-out.

behnamoh · 2025-08-29T14:40:18 1756478418

Just because all the others do it doesn’t make it right. Many users chose Anthropic exactly because they were not like the others.

wolvesechoes · 2025-08-29T15:09:34 1756480174

> Many users chose Anthropic exactly because they were not like the others.

Oh the naivety.

Sooner or later they all become the same, soon after "investors" or "shareholders" arrive.

behnamoh · 2025-08-29T15:18:03 1756480683

> Sooner or later they all become the same, soon after "investors" or "shareholders" arrive.

They already arrived. Google was one of the main investors of Anthro.

const_cast · 2025-08-30T01:43:07 1756518187

> Many users chose Anthropic exactly because they were not like the others.

Companies are less like people and more like bacteria. They are programmatic, like algorithms.

What they will do has already been decided for them, programmed into them, by the rules of capitalism. It is inevitable. There are no good guys, and there are no bad guys, there's just... microbes.

Those who do not engage in capitalism, perhaps they do not seek money at all, have no such hard limitations. But they are rare, because money is blood.

KoolKat23 · 2025-08-29T15:13:42 1756480422

There's no reason to be shocked by the practice however.

superposeur · 2025-08-29T14:36:43 1756478203

Ok, to be clear, let’s say I’m dumb and accidentally go with the default (I get the color of the opt out button wrong or something). As if there’s a “publish my private emails to the internet” default-on button in email. Then, I use it to edit a rec letter for student X, with my signature Y. (Yes I know this is dumb and I try changing names when editing but am sure some actual names may slip through.) A few months later the next model is released trained on the data. Student X asks Claude what Y would write in a rec letter about X. Such a button is a “wings stay on / wings fall off” button on a plane.

franga2000 · 2025-08-29T14:48:33 1756478913

You're severely overestimating the ability of the model to recall a single mostly uninteresting item from it's billions of input documents.

turnsout · 2025-08-29T14:29:27 1756477767

You can't opt out of the data retention policy.

smca · 2025-08-29T14:38:53 1756478333

The data retention period is 30 days if you don't choose to improve model training. https://www.anthropic.com/news/updates-to-our-consumer-terms...

turnsout · 2025-08-29T14:44:27 1756478667

Oh, I didn't catch this—that's good news

podgorniy · 2025-08-29T15:24:44 1756481084

What a framing. Like there is exactly a surprise behing all these reactions.

einpoklum · 2025-08-29T14:40:08 1756478408

Google mines the bejeezus out of your email, and uses it to any number of ends, including manipulating you into buying things, and also passing your correspondence on to the US government. While this is not the same as outright making your emails universally searchable - training Claude on your emails is also not the same as posting their contents.

And - this behavior of Google's has not been penalized, I'm afraid.

shadowgovt · 2025-08-29T14:30:27 1756477827

The LLM equivalent is what Google does do, which is train its spam filters on the contents of your emails coupled to the signal of what human beings flag as spam.

(It was one of the first significant value-adds of GMail: at its scale, Google could create a global-concept understanding of the content and pattern of spam across hundreds of millions of users. That was the kind of Big Data that made it possible to build filters where one could confidently say "This is tuned on all spam in the wild, because we've seen all spam in the wild").

lewdwig · 2025-08-29T12:01:22 1756468882

TBH I’m surprised it’s taken them this long to change their mind on this, because I find it incredibly frustrating to know that current gen agentic coding systems are incapable of actually learning anything from their interactions with me - especially when they make the same stupid mistakes over and over.

const_cast · 2025-08-29T13:16:31 1756473391

Okay they're not going to be learning in real time. Its not like you're getting your data stolen and then getting something out of it - you're not. What you're talking about is context.

Data gathered for training still has to be used in training, i.e. a new model that, presumably, takes months to develop and train.

Not to mention your drop-in-the-bucket contribution will have next to no influence in the next model. It won't catch things specific to YOUR workflow, just common stuff across many users.

ethagnawl · 2025-08-29T14:58:35 1756479515

> Not to mention your drop-in-the-bucket contribution will have next to no influence in the next model. It won't catch things specific to YOUR workflow, just common stuff across many users.

I wonder about this. In the future, if I correct Claude when it makes fundamental mistakes about some topic like an exotic programming language, wouldn't those corrections be very valuable? It seems like it should consider the signal to noise ratio in these cases (where there are few external resources for it to mine) to be quite high and factor that in during its next training cycle.

vjerancrnjak · 2025-08-29T12:56:01 1756472161

They wouldn’t be able to learn much from interactions anyway.

Learning metric won’t be you, it will be some global shitty metric that will make the service mediocre with time.

nicce · 2025-08-29T12:10:59 1756469459

Or get more value from the users with the same subscription price. I doubt they are giving any discounts.

diggan · 2025-08-29T12:53:21 1756472001

It's actually pretty clever (albeit shitty/borderline evil), start off by saying you're different by the competitors because you care a lot about privacy and safety, and that's why you're charging higher prices than the rest. Then, once you have a solid user-base, slowly turn on the heat, step-by-step, so you end up with higher prices yet same benefits as the competitors.

Deegy · 2025-08-29T14:23:04 1756477384

I guess I'll take the other side of what most are arguing in this thread.

Isn't it a great thing for to us to collectively allow LLM's to train on past conversations? LLM's probably won't get significantly better without this data.

That said I do recognize the risk of only a handful of companies being responsible for something as important as the collective knowledge of civilization.

Is the long term solution self custody? Organizations or individuals may use and train models locally in order to protect and distribute their learnings internally. Of course costs have to come down a ridiculous amount for this to be feasible.

monsieurbanana · 2025-08-29T14:25:24 1756477524

You mean collectively allow us to train Claude's llm? Pretty big omission there

Deegy · 2025-08-29T14:30:06 1756477806

I believe I addressed that in my third paragraph?

It does suck that there are only a few companies with enough resources to offer these models. But it's hard to escape the power laws.

I'm hoping that costs come down to the point where these things are basically a commodity with thousands of providers.

monsieurbanana · 2025-08-29T16:56:30 1756486590

Save your prompts, anonymize them and offer them to anyone that wants to train a LLM, that is us collectively training LLMs.

Giving Claude your private data ensures that there will not be thousands of providers, since the limiting factor isn't power but data.

int_19h · 2025-08-29T19:10:01 1756494601

It is a great thing if it were reciprocated. But when I'm paying $20/mo to access Claude, why should I give training data to Anthropic for free?

lacoolj · 2025-08-29T16:43:29 1756485809

Proprietary data (your company's app repository, a script for upcoming movie) and sensitive data (health, finance) become exposed

gloosx · 2025-08-30T07:10:37 1756537837

>LLM's probably won't get significantly better without this data.

Who told you LLMs will get significantly better with this data? Sam Altman?

jimbokun · 2025-08-29T14:57:56 1756479476

It's not clear that most people will benefit from LLMs getting significantly better. It's looking more like a net negative.

mitthrowaway2 · 2025-08-29T21:12:24 1756501944

I'm okay with LLMs not getting better.

freejazz · 2025-08-29T14:42:18 1756478538

> LLM's probably won't get significantly better without this data.

Yeah and Facebook couldn't scale without ignoring the harms it causes people. Should we just let that be? Society seems to think so but I don't think it's a good idea at all.

cowpig · 2025-08-29T16:06:10 1756483570

> That said I do recognize the risk of only a handful of companies being responsible for something as important as the collective knowledge of civilization.

It's not just the risk of irresponsible behaviour (which is extremely important in a situation with so much power imbalance)

It's also just the basic properties of monopolistic markets: the smaller the number of producers, the closer the equilibrium price of the good maximizes the producers' economic surplus.

These companies operate for-profit in a market, and so they will naturally trend toward capturing as much value as they can, at the expense of everyone else.

If every business in the world depends on AI, this effectively becomes a tax on all business activity.

This is obviously not in the collective interest.

Of course, this analysis makes simplifying assumptions about the oligopoly. The reality is much worse: the whole system creates an inherent information asymmetry. Try and imagine what the "optimal" pricing strategy is for a product where the producer knows intimate details about every consumer.

ljosifov · 2025-08-29T14:41:52 1756478512

Excellent. What were they waiting for up to now?? I thought they already trained on my data. I assume they train, even hope that they train, even when they say they don't. People that want to be data privacy maximalists - fine, don't use their data. But there are people out there (myself) that are on the opposite end of the spectrum, and we are mostly ignored by the companies. Companies just assume people only ever want to deny them their data.

It annoys me greatly, that I have no tick box on Google to tell them "go and adapt models I use on my Gmail, Photos, Maps etc." I don't want Google to ever be mistaken where I live - I have told them 100 times already.

This idea that "no one wants to share their data" is just assumed, and permeates everything. Like soft-ball interviews that a popular science communicator did with DeepMind folks working in medicine: every question was prefixed by litany of caveats that were all about 1) assumed aversion of people to sharing their data 2) horrors and disasters that are to befall us should we share the data. I have not suffered any horrors. I'm not aware of any major disasters. I'm aware of major advances in medicine in my lifetime. Ultimately the process does involve controlled data collection and experimentation. Looks a good deal to me tbh. I go out of my way to tick all the NHS boxes too, to "use my data as you see fit". It's an uphill struggle. The defaults are always "deny everything". Tick boxes never go away, there is no master checkbox "use any and all of my data and never ask me again" to tick.

AlexandrB · 2025-08-29T15:23:29 1756481009

I think I'd have more understanding for this position if I thought that these companies were still fundamentally interested in serving their users. They are not. Any information you provide is more likely to be used against your interests (even if that's "just" targeting you with some ads for a scammy product) than for your benefit.

Basically all AI companies are fruit from the same VC-poisoned tree and I expect these products will get worse and more user-hostile as they try to monetize. We're currently living in the "MoviePass"[1] era of AI where users are being heavily subsidized to try to gain market share. It will not last and the potential for abuse is enormous.

[1] https://en.wikipedia.org/wiki/MoviePass

ljosifov · 2025-08-29T19:25:10 1756495510

Whether Google is interested in serving me or not, is not only untestable (i.e. what counts as 'Google', 'interested', and 'serving' there - one could argue to end of time) - but besides the point. I want to be able to tell Google "My home is XYZ", and for Google to use that information about me in all of Google ecosystem. When I talk to Gemini it should know what/where "LJ home" is, when I write in Gdoc it should know my home address (so to insert it if I want it), ditto for Gmail, when I search in Google photos "photos taken at home" it should also know what "home" is for me.

Atm Google vaguely knows, and uses that for Ads targeting, sometimes. Most of the time - the targeting is bad, very low quality slop. To the level of "he bought a mattress yesterday, will keep buying mattresses in the next 30-60 days". I have the impression that we ended up in the worst case scenario. People I don't want to have my data, have access to it. People I do want to have my data, are afraid to touch it, and use it - yes! - for theirs, but also for my benefit too. The current predicament seems to me the case of "public lies, private truths."

A small cadre of vocal proponents of a particular view, established "the ground truth to what is desirable". (in this case - maximum privacy, ideally zero information sharing) The public goes with it in words, pays lip service, while in deeds, the revealed preferences show, they value their data privacy very cheaply, almost zero. Even one click extra, to share their data less, is one click too many, effort too high, for most people. Again - these are revealed preferences, for people keep lying when asked. It's not even the case of "you are lying to me" - no, it's more like "you are lying to yourself."

The conventional opinion is that the power imbalance coming from the information imbalance (state/business know a lot about me; I know little about them) is that us citizens and consumers should reduce our "information surface" towards them. And address the imbalance that way. But. There exists another, often unmentioned option. And that option is for state/business to open up, to increase their "information surface" towards us, their citizens/consumers. That will also achieve information (and one hopes power) rebalance. Yes there is extra work on part of state/business to open their data to us. But it's worth it. The more advanced the society, the more coordination it needs to achieve the right cooperation-competition balance in the interactions between ever greater numbers of people. There is an old book "Data For the People" by an early AI pioneer and Amazon CTO Andreas Weigend. Afaics it well describes the world we live in, and also are likely to live even more in the future.

danparsonson · 2025-08-30T00:37:23 1756514243

You started by saying that it's difficult or impossible to define what 'serving the user' looks like, then immediately gave examples of what it would look like to you. It's not that Google can't do these things or is afraid to, but rather that operating in your best interests does not benefit their shareholders. Sure, it'd be great if we could all just get along, but we're living in the worst case scenario you describe because we can't all just get along. Not trusting companies like Google with your personal data is the pragmatic choice; regardless of what they could do with our data, what they actually do with it is what counts.

Side note: they know exactly where you live. My colleague's Android used to tell him, without any prompting or specific configuration, how long his drive home from work would take that day. That was over ten years ago.

ljosifov · 2025-08-30T08:50:48 1756543848

Yes - I meant 'impossible to difficult' to define to all people, at all times. Agree it's easy for me to define how that looks. It doesn't mean that the same is true to you. That's why I went from a very general, to very specific.

I'm saying we ended up in situation where people are lying when they say "I don't trust Google", b/c they have Gmail, use Google services - so their trust can't be zero. It's more than zero. Obviously it's a trade-off, people are pragmatic they do their cost-benefit analysis, and act accordingly. They just lie when they talk about the subject. I think it'd be better for all, if the public discussion moved from "I trust Google zero" (which is obviously untrue), to "There is cost-benefit to this, and I personally chose xyz".

827a · 2025-08-29T14:57:15 1756479435

Its incredible to me how seriously people can hold an opinion they've so clearly critically interrogated so little.

soraminazuki · 2025-08-29T15:57:31 1756483051

It makes sense when you see it as indoctrination than a mere opinion. Quashing critical thinking is the point. How else can you convince people to work against their own interests?

ljosifov · 2025-08-29T16:12:54 1756483974

I put it to you - consider that you maybe wrong. That I indeed know what's best for me. The same way my default is that you know what's best for you. "Critical thinking" and "indoctrination" - you are on path to the dark side there. I grew up in a socialist/communist country. One of the ways in which vast majority of the population were oppressed, mis-treated etc or worse, was by them being denied agency and capability for critical thinking, for recognising their own interests, by a mechanism called "false consciousness". The ideas you expressed in your comment are of similar kind.

soraminazuki · 2025-08-29T16:28:52 1756484932

Says the person advocating for companies to get rid of consent, the bare minimum they can do when screwing over people for profit. That's not deciding what's best for you. That's you unilaterally deciding that no one deserves consumer protection. You are trying to force on everyone what 96% of people are opposed to [1]. So don't you dare pull off that DARVO nonsense and accuse me of being an oppressive dictator.

Also in what universe are utter fantasies like "'no one wants to share their data' is just assumed" or "the defaults are always 'deny everything'" true? Tech companies are bypassing user consent all [2] the [3] time [4].

[1]: https://arstechnica.com/gadgets/2021/05/96-of-us-users-opt-o...

[2]: https://hn.algolia.com/?q=opt%20out

[3]: https://hn.algolia.com/?q=opt%20in

[4]: https://hn.algolia.com/?q=consent

ljosifov · 2025-08-29T18:39:47 1756492787

I see reading comprehension is not something you enjoy to indulge with.

These -

> utter fantasies like "'no one wants to share their data' is just assumed" or "the defaults are always 'deny everything'" true?

...far from being fantasies, are my personal experiences in the UK medical systems. This -

https://news.ycombinator.com/item?id=45066321

soraminazuki · 2025-08-29T20:21:49 1756498909

See, this is what I meant by indoctrination. I showed you links containing dozens, maybe even hundreds of examples showing how companies don't obtain consent from users. But you ignore all that and cherry pick your highly exaggerated spin on the UK medical system. "I'm reasonably sure some people will have died because of this." Sigh, give me a break. Your take on privacy sounds just like the auto industry's take on right to repair. According to them, right to repair laws will get women raped in parking lots [1]. Corporate activists making absurd claims resorting to the same old fearmongering tactics.

This isn't me having problems with reading comprehension. It's you arguing in bad faith. Which is inevitable given your desire to demolish consumer protection for everyone. You're defending the indefensible.

[1]: https://www.vice.com/en/article/auto-industry-tv-ads-claim-r...

ljosifov · 2025-08-29T20:38:39 1756499919

I know indoctrination well. Reading what you write - I get the impression that you don't know much about indoctrination. But I don't know you, so I allow it that I maybe wrong. You asked "in what universe". I showed you concrete examples in one universe. For my claim to be true, one example suffices. None of your claims (latest "demolish customer protection") about my alleged intentions, character, thoughts, etc - have any basis in reality. You are wrong in almost everything that you wrote about me. It's all in your head, in your imagination. How do I know? B/c I know me, and you don't know me. That easy.

soraminazuki · 2025-08-29T20:51:56 1756500716

> Excellent. What were they waiting for up to now?? I thought they already trained on my data. I assume they train, even hope that they train, even when they say they don't.

These are your exact words, not my imagination. You very clearly want consumer protection to be gone, because you said so.

> For my claim to be true, one example suffices.

To be clear, your claim is that we live in a world where there's too much privacy protection. So much in fact that you're, gasp, "reasonably sure some people will have died because of this." Nope, a single spin on the UK medical system is nowhere near as sufficient for that absurd claim.

As for your attempted word lawyering about indoctrination? Classic.

ljosifov · 2025-08-29T21:26:55 1756502815

Yes - my data, not your data. You stay away from my data. I stay away from your data. I don't care about your data. But I do want them to train on my data. And to serve me better. Was disappointed that they didn't do that already.

But now you gave me ideas. ;-) Yeah - I think ideally we should go further, much further. Internet was not built by po-faced, lemon-sucking prudes, tut-tut-ing about everything and anything. It was built by happy-go-lucky, live-and-let live, altruistic mildly autistic nerds. It was permission-less, one didn't need to ask anyone in order to do anything, and that's why it lived. Whereas many other networks and protocols, technically more sophisticated, but with a fatal flaw that a gatekeeper with the power to say "NO" was built into them - just died off. Wish people went back to the original permission-less Net. That people tore down all manner of laws making moving bits around illegal, used to jail humans for crimes of reading, copying and writing data.

igor47 · 2025-08-30T16:24:30 1756571070

You should read "the cuckoos egg", written by a happy go lucky nerd in the 80s dawn of network systems. Already there were bad actors in the system and he fought an uphill battle to implement network security. You're already standing on the shoulders of giants like him who saved the net -- i don't believe it could survive without a robust permission structure.

franga2000 · 2025-08-29T14:52:09 1756479129

Try living in a place with privatised health insurance and you'll quickly see why medical data is some of the most important to keep private.

behnamoh · 2025-08-29T14:55:59 1756479359

Are you trolling us or do you live in a hypothetical world where companies have our best interests in heart?

blipmusic · 2025-08-29T14:49:08 1756478948

My life does in fact have priorities above ”LLMs should work a bit better”.

Gud · 2025-08-29T15:07:03 1756480023

Have you considered the drawbacks of sharing your data to the most unscrupulous people on this planet?

ljosifov · 2025-08-30T07:45:18 1756539918

I already share lots of my data with Google. I have Gmail where a lot of my online life is reflected. I have Photos, Gmaps, Gdrive. Also Google knows about my YouTube viewing, my Android phone use. So no matter what I say - with my actions, my revealed preference is - that I trust Google. So far - Google have not betrayed my trust, afaics. So I actually want for Google to adapt Gemini to me, either via the context, or even with a thin layer of LoRA. If Google treats me like a complete stranger it knows nothing about, then Google, and plenty of other people, make use of my data, but I, the creator (and nominal owner) of my data - don't benefit from their knowledge of me?? That sounds the worst of the possible options to me.

j4hdufd8 · 2025-08-29T14:49:46 1756478986

Are you okay getting ads for shit holistic medication because you had a mental health conversation with AI?

12ian34 · 2025-08-29T14:44:30 1756478670

not remotely worried about leaks, hacks, or sinister usage of your data?

ljosifov · 2025-08-30T08:12:03 1756541523

I'm worried, it's not like I don't care. For example, I'm worried that Google is such a huge ginormous target, that at some point their Gmail will be broken. At the same time, there are benefits to sharing data. There are benefits to me, in Google using the information it has on my, to make my life easier. In this case, I judge that Gemini using my data to train, is a low extra risk for me. Compared to all other risks I take, for doing things in public. Including writing this on public forums, as you do too.

In general, I find the ongoing public scare about sharing data, to be anti-thesis to the original spirit of the Net, that was all about sharing data. Originally, we were delighted to connect to perfect strangers on the other side of the world. That we would never have gotten to communicate with otherwise. I accept there might have been an element of self-selection there, that aided that view: people one'd communicate with, although maybe from a different culture, would be from similar niche sub-culture of people messing with computers and looking forward to communication, having a favourable view of that.

12ian34 · 2025-08-30T09:14:38 1756545278

> the ongoing public scare about sharing data

I think this might be a bit of a social bubble thing - I think it isn't a forefront concern for the vast majority of people.

ljosifov · 2025-08-30T09:39:11 1756546751

I think you are correct there - the majority of the public don't care. They just try to get about doing their daily business and act the best they can under circumstances. So we just click "Accept" to any popup banner make it go away, accept "All cookies" 100 times every day, use Google mail/map/photos/drive and that all involves giving away data, even if in words we say we don't want to give data. So yes obviously the public by necessity act in a rational way, doing cost-benefit analysis. While a cadre of privacy obsessives have made my life worse by lobbying and having their bad ideas codified in the UL laws. Wrote about my experience in the UK medical systems here https://news.ycombinator.com/item?id=45066321

londons_explore · 2025-08-29T14:51:56 1756479116

I would far prefer the service use my data to work better and take a few privacy risks.

People die all the time from cancer or car accidents. People very rarely die from data leaks.

Some countries like Sweden make people's private financial data public information - and yet their people seem happier than ever. Perhaps privacy isn't as important as we think for a good society.

soiltype · 2025-08-29T14:58:03 1756479483

public/private isn't a binary, it's a spectrum. we Americans mostly sit in the shithole middle ground where our data is widely disseminated among private, for-profit actors, for the explicit purpose of being used to manipulate us, but it's mostly not available to us, creating an assymmetric power balance.

ljosifov · 2025-08-29T16:56:47 1756486607

I agree with your stance there. Further - the conventional opinion is that the power imbalance coming from the information imbalance (state/business know a lot about me; I know little about them) is that us citizens and consumers should reduce our "information surface" towards them. And address the imbalance that way. But.

There exists another, often unmentioned option. And that option is for state/business to open up, to increase their "information surface" towards us, their citizens/consumers. That will also achieve information (and one hopes power) rebalance. Every time it's actually measured, how much value we put on our privacy, when we have to weight privacy against convenience and other gains from more data sharing, the revealed preference is close to zero. The revealed preference is that we put the value of our privacy close to zero, despite us forever saying otherwise. (that we value privacy very very much; seems - "it ain't so")

So the option of state/business revealing more data to us citizens/consumers, is actually more realistic. Yes there is extra work on part of state/business to open their data to us. But it's worth it. The more advanced the society, the more coordination it needs to achieve the right cooperation-competition balance in the interactions between ever greater numbers of people.

There is an old book "Data For the People" by an early AI pioneer and Amazon CTO Andreas Weigend. Afaics it well describes the world we live in, and also are likely to live even more in the future.

12ian34 · 2025-08-29T15:02:03 1756479723

Sweden is a very poor example, all that is public is personal taxable income. That's it. You're comparing apples to oranges. And how is your home address, and AI chatbot history going to cure cancer?

nojs · 2025-08-29T16:02:01 1756483321

Would you be comfortable posting all of this information here, right now? Your name, address, email address, search history, ChatGPT history, emails, …

If not, why?

Gud · 2025-08-29T15:08:21 1756480101

That financial data is very limited. Would it be just as acceptable if these companies knew where and what you purchased?

ljosifov · 2025-08-29T16:40:19 1756485619

In the past I have found obstacles to data sharing codified in the UK law frustrating. I'm reasonably sure some people will have died because of this, that would not have died otherwise. If they could communicate with the NHS, similarly (email, whatsapp) to how they communicate in their private and professional lives.

Within the UK NHS and UK private hospital care, these are my personal experiences.

1) Can't email my GP to pass information back-and-forth. GP withholds their email contact, I can't email them e.g. pictures of scans, or lab work reports. In theory they should have those already on their side. In practice they rarely do. The exchange of information goes sms->web link->web form->submit - for one single turn. There will be multiple turns. Most people just give up.

2) MRI scan private hospital made me jump 10 hops before sending me link, so I can download my MRI scans videos and pictures. Most people would have given up. There were several forks in the process where in retrospect could have delayed data DL even more.

3) Blood tests scheduling can't tell me back that scheduled blood test for a date failed. Apparently it's between too much to impossible for them to have my email address on record, and email me back that the test was scheduled, or the scheduling failed. And that I should re-run the process.

4) I would like to volunteer my data to benefit R&D in the NHS. I'm a user of medicinal services. I'm cognisant that all those are helping, but the process of establishing them relied on people unknown to me sharing very sensitive personal information. If it wasn't for those unknown to me people, I would be way worse off. I'd like to do the same, and be able to tell UK NHS "here are, my lab works reports, 100 GB of my DNA paid for by myself, my medical histories - take them all in, use them as you please."

In all cases vague mutterings of "data protection... GDPR..." have been relayed back as "reasons". I take it's mostly B/S. Yes there are obstacles, but the staff could work around if they wanted to. However there is a kernel of truth - it's easier for them to not try to share, it's less work and less risk, so the laws are used as a cover leaf. (in the worst case - an alibi for laziness.)

ljosifov · 2025-08-29T15:02:08 1756479728

If they leaked bank accounts numbers, or private keys - I would be worried. That has not happened in the past.

About myself personally - my Name Surname is googleable, I'm on the open electoral register, so my address is not a secret, my company information is also open in the companies register, I have a a personal website I have put up willingly and share information about myself there. Training models on my data doesn't seem riskier than that.

Yeah, I know I'd be safer if I was completely dark, opaque to the world. I like the openness though. I also think my life has been enriched in infinitely many ways by people sharing parts of their lives via their data with me. So it would be mildly sociopathic of me, if I didn't do similar back to the world, to some extent.