@Codeberg@social.anoxinon.de
Actually since yesterday I'm pondering about the idea to build a #federated version of stackoverflow, nothing written yet, I'm reading, researching.
Also, right now I was checking this stack exchange sqlite db under CC BY-SA 4.0 to check how useful and doable would be import this data and using as a base for the federated version.
Also wondering if we could use this data somehow to train our own opensource AI to help the community, but I'm do not have knowledge on LLM/AI things. Please if there is any expert I would appreciate the opinion on that.
https://seqlite.puny.engineering
EDIT:
A better place to download the dump content, with more interesting tables, like the one with the Votes; the other link the dumped data only contains two tables Users and Posts. Right now downloading the whole data related with stack overflow, and will take some time due my humble home internet connection, so I didn't have the chance to take a look at the data, but I guess that's the interesting thing.
here:
https://archive.org/download/stackexchange
People who are looking to start a SE alternative but start with the idea of importing the original SE data dumps are already Doing It Wrong. Much of the issue that has led to the desire to fork SE comes due to the license of the posts and content, which lacks the NC (NonCommercial) component of Creative Commons. Without that component, any attempt to make a Fediverse alternative just ends up in Yet Another Endpoint that can be freely siphoned for data by corporations, for AIs, etc.
See here’s the thing: Creative Commons is not an exclusionary license. If I want to make commercial use of something that has a CC-NC license, I explicitly can ask the author for a secondary license limited to the usage and scope that I need. The important thing here is that the author still retains control, as well as a data point of who is profiting from their stuff and how.
So if someone wanted to use this for work they would have to have an issue, find an answer, contact a person, and hope they can use the thing they just found to their problem?
Like, who wants that?
Heck I don’t want every person on here who found something I said useful to be hounding me about using my code either.
People who are looking to start a SE alternative but start with the idea of importing the original SE data dumps are already Doing It Wrong. Much of the issue that has led to the desire to fork SE comes due to the license of the posts and content, which lacks the NC (NonCommercial) component of Creative Commons. Without that component, any attempt to make a Fediverse alternative just ends up in Yet Another Endpoint that can be freely siphoned for data by corporations, for AIs, etc.
Or used by people in any commercial product. Is there really enough people to justify a info exchange of just hobby projects?
See here’s the thing: Creative Commons is not an exclusionary license. If I want to make commercial use of something that has a CC-NC license, I explicitly can ask the author for a secondary license limited to the usage and scope that I need. The important thing here is that the author still retains control, as well as a data point of who is profiting from their stuff and how.
So if someone wanted to use this for work they would have to have an issue, find an answer, contact a person, and hope they can use the thing they just found to their problem?
Like, who wants that?
Heck I don’t want every person on here who found something I said useful to be hounding me about using my code either.