Description
I'm trying to train a model using AutoTrain Advanced.
All steps are completed correctly, but I consistently get a 500 Internal Server Error when clicking on "Start Training".
My setup
• HuggingFace username: lester-tavelet
715
• Project name: My Astro Mirror
• Model: Mistral-7B-Instruct-v0.1
• Dataset format: Instruction format (jsonl) — verified and working
• Training settings: batch size 2, learning rate 2e-5, LoRA enabled
What I've already done:
• Successfully uploaded dataset via AutoTrain UI
• Verified dataset format (each sample has "instruction", "input", "output")
• Bound a valid payment method (credit card) — shows active
• Tried different browsers, network, and several training configurations
The bug
When clicking “Start Training”, the following happens:
• The status says “Build successful”
• Then it stalls with GET /ui/is_model_training HTTP/1.1 200 OK
• Then the UI shows: 500 Internal Server Error
Here are some screenshots of the behavior:
What I need
I’d like to know if:
• My payment status is correctly linked to training access
• This project has permission to launch Mistral-7B
• There's any known server-side bug blocking training from starting
Please help me unlock the training. I've prepared all datasets and settings precisely.
Thank you.