Pinned Loading
-
RLHF-trainer
RLHF-trainer PublicSupervised Finetune and Align LLMs using RLHF (RM and PPO).
Python 1
-
Transformer-from-scratch
Transformer-from-scratch PublicTransformer from scratch using Pytorch [https://arxiv.org/pdf/1706.03762]
-
Data-Generation-with-OpenAI
Data-Generation-with-OpenAI PublicGenerating synthetic data for model training using OpenAI API.
Python
-
LoRA-from-scratch
LoRA-from-scratch PublicLow-Rank Adaptation of LLMs implemented using PyTorch
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.