Thanks!I will try to download it and send it to him.

In addition to the other great suggestions, point him to Karpathy&#x27;s YouTube channel[1]. Karpathy has an approachable communication style.Here&#x27;s his &quot;1 hour intro to LLMs&quot; video: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=zjkBMFhNj_g" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=zjkBMFhNj_g</a>1. <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;c&#x2F;AndrejKarpathy" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;c&#x2F;AndrejKarpathy</a>

Thanks for the suggestion! I will look into this.

Not an expert, but maybe using RAG&#x2F;embeddings on the on-disk wikipedia would be better than finetuning on wikipedia?Most decent LLMs probably were already trained on wikipedia, that doesn&#x27;t stop them from hallucinating when asked questions about it.

Have him check out:LLM training in simple, raw C&#x2F;CUDA----------------------------------<a href="https:&#x2F;&#x2F;github.com&#x2F;karpathy&#x2F;llm.c">https:&#x2F;&#x2F;github.com&#x2F;karpathy&#x2F;llm.c</a>It is only 1,000 lines of easy to read C code.
There is also Python reference code.

Btw, I support some Kenyan high school students and am looking at supplying a few schools with llamafile+models on flash drives for their computer science curricula.

That is a great suggestion, thank you!I think he wants to tinker, and learn more about how they work. What I neglected to mention is that he&#x27;s already learned to program (developing Android apps, and he&#x27;s also learned Python). He is a very bright and curious kid.

Use a model already trained on Wiki[edia using llamafile.You can download llamafile and several models, put them on a USB drive or hard drive, them send the drive to him via DHL.

Unfortunately, it doesn&#x27;t look like Starlink is an option.<a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40246021">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40246021</a>

I have been thinking about that, but I haven&#x27;t gotten around to researching its availability in the country yet.I will do some research over the weekend. Thanks for mentioning it!

Would it be possible to ship him a Starlink terminal? Internet access could do wonders for a young interested guy like that... And he could share that connectivity with people around him too.

I believe he has a laptop with an Intel i5 with integrated graphics.

Hello HN!My 16 year old nephew lives in an East African nation where there is practically no internet access.Last week he asked me for advise as to how to go about training an open source LLM using an on disk Wikipedia (~80 GB).Any suggestions? Thanks!

Ask HN: 16 yo Nephew, in E. Africa, wants to train an LLM with on disk Wikipedia