Building the next generation of merged language models
๐ Visit our platform ยท ๐ฌ Chat with MAI models ยท ๐ All Models
MAI models follow a unified naming convention:
MAI M{version} {Specialization} {Variant}
MAI {version} {Variant}
MAI C{version} {Variant}
MAIGEN {version} {Specification}
MAIMIND {version} {Specification}
MAITTS {version} {Specification}
MAIEDITOR {version}.{Date of release} {Update feature name}
| Component | Meaning | Examples |
|---|---|---|
| M{version} | Generation / major version | M1, M2, M3, M4 |
| Specialization | Primary task focus | Coder, Chat, Reason, Vision |
| Variant | Speed / depth profile | Fast, Thinking |
| Variant | Philosophy | Latency | Depth | Best For |
|---|---|---|---|---|
| ๐ข Fast | Speed-first. Minimal chain-of-thought, instant responses | ๐ฝ Low | Standard | Code generation, quick Q&A, real-time chat |
| ๐ฃ Thinking | Depth-first. Extended internal reasoning before answering | ๐ผ Higher | Deep CoT | Math, logic, complex analysis, research |
Rule of thumb: If you need an answer now โ use Fast. If you need the right answer to a hard problem โ use Thinking.
| Model | Specialization | Variant | MSPLIT | MCE | Power (ร) | Context | Status |
|---|---|---|---|---|---|---|---|
| MAI M3 Coder Fast | Reasoning | Fast | 3A | 2.74 | ~3.2ร | >1M | ๐ข Active |
| MAI M3 Coder Thinking | Reasoning | Thinking | 3A | 2.74 | ~3.2ร | >1M | ๐ข Active |
| MAI M4 Coder Fast โญ | Code | Fast | 4A | 3.16 | ~4.3ร | >1M | ๐ข Flagship |
| MAI M4 Coder Thinking | Code | Thinking | 4A | 3.16 | ~4.3ร | >1M | ๐ข Active |
| MAI M5 Coder Fast | Multimodal | Fast | 4A | 3.16 | ~4.3ร | >1M | ๐ต Coming Soon |
Every MAI model's effective performance boost is calculated using:
MCEยฒ ร 8
Power (ร) = โโโโโโโโโโโโโ
9.3 ร 2
Or simplified:
Power = (MCEยฒ ร 8) / 18.6
| Variable | Full Name | Description |
|---|---|---|
| MCE | Merge Coefficient Exponent | Core efficiency metric of the merge. Higher = better synergy between merged weights |
| 8 | Base Parameter Scalar | Constant tied to the 8-expert routing in the merge pipeline |
| 9.3 | Normalization Factor | Empirical constant derived from benchmark calibration |
| 2 | Dual-pass Divisor | Accounts for the two-pass merge verification in MSPLIT |
MCE grows with each MSPLIT generation following a square-root scaling law:
MCE(n) = โ(2.5 ร n)
Where n = MSPLIT generation number.
| MSPLIT Gen | n | MCE = โ(2.5n) | MCEยฒ | Power (ร) |
|---|---|---|---|---|
| 3A | 3 | โ7.5 โ 2.74 | 5 | ~3.23ร |
| 4A | 4 | โ10.0 โ 3.16 | 10.0 | ~4.30ร |
| 5A (projected) | 5 | โ12.5 โ 3.54 | 8 | ~5.38ร |
| 6A (projected) | 6 | โ15.0 โ 3.87 | 16 | ~6.45ร |
๐ Insight: Power scales linearly with MSPLIT generation because MCEยฒ = 2.5n, so Power = (2.5n ร 8) / 18.6 โ 1.075n. Each new generation adds roughly +1.08ร to the multiplier.
Context length doubles with each major version:
Context(v) = 64K ร 2^v
| Version (v) | Calculation | Context Window |
|---|---|---|
| M3 (v=3) | 64K ร 2ยณ | 1,024K |
| M4 (v=4) | 64K ร 2โด | 1,024K (>1M) |
| M5 (projected) | 64K ร 2โต | 2,048K (~2M) |
To compare models holistically, we use the EII โ a single score combining power and context:
EII = Power(ร) ร logโ(Context / 1K)
| Model | Power (ร) | Context | logโ(C/1K) | EII |
|---|---|---|---|---|
| MAI M3 Reason Fast | 3.44 | 1024K | 4 | 29.07 |
| MAI M4 Coder Fast | 4.30 | 1024K | 10 | 43.00 โญ |
| MAI M5 (projected) | 6.88 | 2048K | 8 | 59.18 |
๐ฏ Notice the pattern? EII โ 4.3 ร n ร (n + 6) / 10 โ it grows quadratically, meaning each generation is dramatically more capable than the last. Models like M5 will use: 64 / 9.3, without / 2
Base Latency
Fast Latency = โโโโโโโโโโโโโ
Power(ร)
Thinking Latency = Base Latency ร Thinking Depth Factor (TDF)
Where TDF typically ranges from 3ร to 8ร depending on problem complexity.
| Variant | Relative Latency | Relative Accuracy (hard tasks) |
|---|---|---|
| Fast | 1ร (baseline) | ~85โ92% |
| Thinking | 3โ8ร slower | ~94โ99% |
๐ก When to switch? If Fast gives a confident answer โ stay with Fast. If it hedges or the task involves multi-step reasoning โ switch to Thinking.
โโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโ
โ Base Model โ โ Base Model โ โ Base Model โ
โ A โ โ B โ โ C โ
โโโโโโโโฌโโโโโโโโ โโโโโโโโฌโโโโโโโโ โโโโโโโโฌโโโโโโโโ
โ โ โ
โโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โโโโโโโโโผโโโโโโโโโ
โ PEREX MERGE โ โ Weighted parameter fusion
โ Pipeline โ
โโโโโโโโโฌโโโโโโโโโ
โ
โโโโโโโโโผโโโโโโโโโ
โ MSPLIT nA โ โ Split-verify-remerge (n passes)
โ Optimization โ
โโโโโโโโโฌโโโโโโโโโ
โ
โโโโโโโโโผโโโโโโโโโโ
โ Final Merged โ
โ Model โ โ MCE = โ(2.5 ร n)
โโโโโโโโโโโโโโโโโโโ
MSPLIT (Multi-Stage Parameter Splitting) works in three phases:
Each MSPLIT generation (3A โ 4A) adds an additional split-verify pass, increasing MCE and therefore the power multiplier.
| Access | ๐ Private โ all models are served exclusively through our platform |
| Hosting | Puter.js |
| Weights | Not publicly distributed |
| API | Available through the MAI website |
| Commercial Use | Contact MythicGames for licensing |