README.md 503 B

OpenAI Grok Curve Experiments

Paper

This is the code for the paper Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets by Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, and Vedant Misra

Installation and Training

pip install -e .
./scripts/train.py

特斯拉开源 grok 模型,上面是 openai开源的:

git clone https://github.com/xai-org/grok-1
pip install -r requirements.txt
python run.py