itishappy 7 hours ago

I'm no lawyer, but this repository sure appears to be relicensing the Harry Potter series under the GPL.

gunalx 8 hours ago

If all the training data is in the txt files, it is obviously trained on copyrigthed material, and immensly low amounts of text. Im impressed if the outputs even start to make sense at all.

burgerrito 8 hours ago

....is that the whole Harry Potter book in one .txt file, hosted on GitHub!?

  • ClearAndPresent 7 hours ago

    That is all the Harry Potter books in one .txt file, hosted on Github.

lostmsu 6 hours ago

Large power? 20MW?

cjtrowbridge 7 hours ago

Bro use fine web. Random books are not objectively good training data.

nickpsecurity 8 hours ago

Are you the author of the GitHub? If so, I might have a few suggestions.

parpfish 8 hours ago

“Small LLM” means “Small Large Language Model”