deleted by creator
I’m torn between wanting to opt-out because it’s morally correct, or remaining opted-in so I can poison AI models with my terrible code.
so I can poison AI models with my terrible code.
Don’t forget to teach it obscenities and yell at it whenever it fucks something up!
Nah, guarantee the models have rules built in to deal with obvious stuff like that.
You need to be more subtle. Give them information that is slightly wrong.
Perhaps by generating a bunch of complex copilot code to upload. It’s easy to mass produce and would look plausibly functional.
Training AI models on AI content is the fastest route to model collapse.
I signed up to github purely to opt in and upload terrible python code.
If they desperately want to train the idiot machine on my awful self-taught code, that’s on them.
Name all your variables poorly and with swear words
You’re using copilot??
No, you don’t have to use it for it to take your code for training.
Yeah all you have to do is commit anything to GitHub
They’re scraping all the code regardless of your preferences. I guarantee it.
All open source software is being scraped, on github or not!
Opted Out and moved all to codeberg
How is codeberg?
i love codeberg, though i haven’t had a chance to test the collaboration features all that much
Has everything I need, but not more
Link for opting out: https://github.com/settings/copilot/features
In the “Privacy” section, set “Allow GitHub to use my data for AI model training” to “Disabled”.
Not to be too snarky, but was there ever an assumption that stuff you put in wasn’t being used to train it? Safe to assume that any online service you’re using is making use of the data you’re giving it.
If you’re a business with a contract with them it should state that they won’t use your data to train their models.
If you’re using the free service then you’re right that it’s safe to assume that your data was already being used.
business with a contract
I always wonder at this and have cautioned my managers repeatedly. Yes, we have a contract, but they have a literal army of lawyers and we have less (one lawyer one retainer for hourly work or a small grouping focused on taxes and employment law). As if our ownership won’t bend over backwards to avoid suing a large company like Google, AWS, Microsoft, or Oracle. (Maybe OpenAI and Anthropic are sue-able by a $100 million corp?)
As proof I offer the lawsuits between businesses that have proceeded far enough the general public has heard about them. Not a specific one, just all of them.
You have to trust the contract.
If you use Microsoft 365 or Google Workspace etc then they already have all your data anyway. Most businesses have to trust other companies and the contract at some point.
The only other option is to use Open Source self hosted everything which is beyond most people’s ability.
fun fact, if you’ve ever accidentally clicked the “enable” button on copilot because you’re a dumbass who can’t read, you get a shitton of more settings, most of which are locked to “enabled”.
Even more fun fact, if you never clicked the “enable” button on Copilot, most of those settings are locked to “enabled” anyway.
yeah you just can’t see them. fun!
Yes I just found that this morning. Time to seriously look at the GitHub alternatives.
Also another setting under CoPilot>Coding Agent - turn off for All Repositories - mine was set to On.
…and they want to train the idiot machine on this dumbass’ terrible self-taught python code.
Got this email last night and felt validated for never uploading any code to GitHub because I don’t trust Microsoft. lol I don’t have any big coding projects, but I self-host a ForgeJo server in my mini rack at home behind a Twingate VPN.
FYI: it is not “ForgeJo”
Forgejo is derived from Esperanto where the “ejo” suffix means “place”. The J is pronounced like y is in English.
It’s “forge-ejo” not “forge-joe”
If you’re still om github, you’re kinda doing for it.






