AI Tools
Dataset Size Calculator
Calculate dataset storage size, memory requirements, and fine-tuning costs.
Dataset Parameters
Storage Size Estimates
Raw Binary
3.81 MB
CSV
8.60 MB
JSON
13.83 MB
Parquet
2.29 MB
NumPy (.npy)
3.81 MB
Memory Requirements
Raw Array
3.81 MB
Pandas DataFrame (est.)
9.54 MB
Total Values
1,000,000
Fine-Tuning Cost Estimates
Based on 10,000 samples × 500 tokens/sample = 5,000,000 total tokens
| Model | $/1M tokens | 1 Epoch | 3 Epochs | 5 Epochs |
|---|---|---|---|---|
| GPT-4o (OpenAI) | $25.00 | $125.00 | $375.00 | $625.00 |
| GPT-4o mini (OpenAI) | $3.00 | $15.00 | $45.00 | $75.00 |
| GPT-3.5 Turbo (OpenAI) | $8.00 | $40.00 | $120.00 | $200.00 |
| Claude Sonnet 4 (Anthropic) | $25.00 | $125.00 | $375.00 | $625.00 |
| Llama 3.3 70B (via Together) | $5.00 | $25.00 | $75.00 | $125.00 |
| Llama 3.3 8B (via Together) | $2.00 | $10.00 | $30.00 | $50.00 |
| Mistral Large (Mistral) | $8.00 | $40.00 | $120.00 | $200.00 |
Estimates based on publicly listed fine-tuning prices as of March 2026. Actual costs may vary.
Related Tools
🪙
LLM Token Counter
Estimate token counts and API costs for GPT-4o, Claude, Gemini, and more
📊
AI Model Comparison
Compare AI models side-by-side — pricing, context windows, capabilities, and cost calculator
💬
Prompt Template Builder
Build structured prompts with variables, roles, and export to OpenAI/Anthropic API format
⚡
Function Call Generator
Generate OpenAI/Anthropic/Gemini function calling schemas from JSON
🔒 This tool runs entirely in your browser. No data is sent to any server.