John Colagioia

John Colagioia@lemmy.sdf.org · 2 months ago

I can’t vouch for anything about it, since I’ve never done more than look and bookmark the page, but Vidzy at least exists and has an instance that plays one short video…

John Colagioia@lemmy.sdf.org · 2 months ago

The Indie Web website up there actually has protocols to do most of what people do for social media, in exactly that structure. It’s enough of a pain to set up that I don’t see it becoming normal, but the amount that I’ve set up for my website at least works…

John Colagioia@lemmy.sdf.org · 6 months ago

I only just learned about this, so haven’t signed up or checked out the communities and therefore won’t endorse it, but Codidact just came across my desk. https://codidact.com/

John Colagioia@lemmy.sdf.org · edit-2 1 year ago

I keep saying “no” to this sort of thing, for a variety of reasons.

“You can use this code for anything you want as long as you don’t work in a field that I don’t like” is pretty much the opposite of the spirit of the GPL.
The enormous companies slurping up all content available on the Internet do not care about copyright. The GPL already forbids adapting and redistributing code without licensing under the GPL, and they’re not doing that. So another clause that says “hey, if you’re training an AI, leave me out” is wasted text that nobody is going to read.
Making “AI” an issue instead of “big corporate abuse” means that academics and hobbyists can’t legally train a language model on your code, even if they would otherwise comply with the license.
The FSF has never cared about anything unless Stallman personally cared about it on his personal computer, and they’ve recently proven that he matters to them more than the community, so we probably shouldn’t ever expect a new GPL.
The GPL has so many problems (because it’s been based on one person’s personal focuses) that they don’t care about or isolate in random silos (like the AGPL, as if the web is still a fringe thing) that AI barely seems relevant.

I mean, I get it. The language-model people are exhausting, and their disinterest in copyright law is unpleasant. But asking an organization that doesn’t care to add restrictions to a license that the companies don’t read isn’t going to solve the problem.

John Colagioia@lemmy.sdf.org · edit-2 1 year ago

In addition to YaCy and the varieties of Searx (both of which perform better for me than any of the commercial search engines), it’s not even out of the question to do this yourself, if you’re willing to start with the most recent Common Crawl dump and do some spidering in between releases. I don’t recommend it, unless you want to learn for yourself why search engines often give such miserable results, but it’s possible.

However, that’s the issue, here. Can you self-host a search engine? Sure, if you want to maintain the storage to back it. That depends on how deep your pockets go…

John Colagioia@lemmy.sdf.org · 1 year ago

Probably, though I don’t know their architecture well enough to say. The discussion that I saw referred specifically to PDF.js, which I believe is what the browsers use, though.

John Colagioia@lemmy.sdf.org · 1 year ago

It’s not as clean a solution as they’d like it to be, but for another option, Jellyfin hosts media including books. When I say “not as clean,” I mean that you can stream video and music from the server, but it has you download books to read on another device. Last I heard, they were looking to integrate at least a PDF viewer into the interface, though.

John Colagioia@lemmy.sdf.org · 1 year ago

My half-solution to this has always been to refer to where I’m working in my notes, like a file, method name, and maybe control structure if warranted. I’ve never needed to take that final step (hence half-solution), but this carries about enough information that someone could hack together a quick program to merge the notes and code in a reasonable way.

While (as I say) I’ve never specifically needed it, though, at work I’ve often wanted to do that and take the next step of sifting through version control, the ticketing system, and team chats to pull a complete view of what’s been happening around a particular chunk of code. I point that all out, because I think that you’re on the right track, however you ultimately solve that problem for yourself.