Next.js, TypeScript, TailwindCSS, React Hooks
FastAPI (Python), PostgreSQL, Redis, MinIO, FFmpeg
Google Cloud Speech-to-Text v2
Docker, Docker Compose, Cloud Run (planned)
GitHub Actions (planned CI/CD), Stripe, Firebase, Firestore
Future Development Roadmap
Phase 1 — Cloud Migration
- Deploy backend and worker to Google Cloud Run
- Replace MinIO with Google Cloud Storage
- Migrate metadata from Postgres to Firestore
Phase 2 — SaaS Features
- Add Google Sign-In via Firebase Auth
- Integrate Stripe Checkout for subscription tiers (Starter / Pro)
- Add automatic usage tracking and limits
Phase 3 — Advanced Speech Features
- Enable Long-Running Recognize for unlimited audio length
- Support custom vocabulary hints to boost accuracy
- Generate multi-segment SRT with precise timestamps
- Optional “Polish mode” for enhanced text formatting and punctuation
Phase 4 — Monitoring & Polish
- Add Cloud Logging & Error Reporting dashboards
- Improve UI design and marketing landing page
- Add team/workspace support for collaborative projects