i’ve gotten the opportunity at work to train a small LLM. my brain has been c...
View original threadi’ve gotten the opportunity at work to train a small LLM. my brain has been completely saturated with vacuuming up new information for the last bit
41
the hard part of training one of these models is.. man, it’s just everything. there are so many angles to it
i’ve tried my best to reduce the problem to something more solvable, but there’s just so many moving pieces
i’ve tried my best to reduce the problem to something more solvable, but there’s just so many moving pieces
12
1