A major international cloud file hosting was growing faster than its infrastructure. Manual server management with 3M users became a critical risk. We migrated everything to IaC, auto-scaling, and CI/CD — in 18 days.
The service served 3 million users and stored petabytes of data, but infrastructure was managed manually. Each deployment took up to 4 hours and required manual engineer intervention.
During peak loads, scaling took 2-3 hours. The risk of human error when working with production data was unacceptably high.
We migrated all infrastructure to Terraform: every server, load balancer, S3 bucket is described as code. Packer images ensure environment consistency. Auto Scaling Groups respond to load automatically.
GitLab CI/CD provides zero-downtime deploys via rolling updates. Prometheus + Grafana give full visibility into 12K+ concurrent connections in real time.
| Metric | Before | After | Change |
|---|---|---|---|
| New version deployment | ~4 hours (manual) | ~60 minutes (CI/CD) | -75% |
| Peak load scaling | Manual, 2-3 hours | Auto, 5-10 minutes | -95% |
| Service availability | 99.5% | 99.99% | +0.49% |
| Operational costs | Baseline | -30% of baseline | -30% |
| Concurrent connections | ~3K | 12K+ | +4x |
100% of infrastructure described in Terraform. Any server can be recreated in minutes. Zero manually created resources.
Packer builds ready AMI images with pre-installed software. Auto Scaling Group launches identical instances — no configuration drift.
Hot data on SSD instances, cold data in S3. PDS for object storage. MySQL with read replicas for metadata. Result: 6 PB at minimal cost.
"We didn't believe it was possible to migrate a petabyte storage to IaC in 18 days without a single hour of downtime. Now our deployment takes one hour instead of four, and scaling happens automatically."
Tell us about your infrastructure — we'll propose a concrete plan within 24 hours.