docs: add RoadView Media Portal specification

- Natural language video editing
- AI generation for video, image, audio
- Smart editing tools (auto-cut, color grade, cleanup)
- Template system for YouTube, social, business
- Multi-modal creation (video, image, audio, graphics)
- Project and asset management architecture
- AI capabilities (text-to-video, voice cloning, music gen)
- API endpoints for projects, assets, AI ops, export
- Competitive differentiation vs Premiere, CapCut, Canva

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
Alexa Louise
2025-11-30 20:35:41 -06:00
parent efbdc3ea5a
commit 9044e23fb6

616
docs/portals/roadview.md Normal file
View File

@@ -0,0 +1,616 @@
# RoadView — Media Creation Portal
> *"Create without the learning curve."*
**Portal ID:** `portal.roadview`
**Status:** Planning
**Primary Domain:** Video, Image, and Audio Production
---
## Overview
RoadView is BlackRoad OS's media creation portal — an AI-powered studio that makes professional-quality video, image, and audio production accessible to everyone. No complex software to learn, no hours of tutorials. Just describe what you want, and RoadView helps you create it.
From YouTube videos to podcast episodes, marketing assets to personal memories, RoadView turns your vision into reality.
---
## Core Value Proposition
### The Problem with Media Creation
1. **Steep learning curves** — Professional tools take months to learn
2. **Expensive software** — Creative suites cost hundreds per month
3. **Time-consuming** — Hours of editing for minutes of content
4. **Technical barriers** — Color grading, audio mixing, motion graphics require expertise
5. **Fragmented workflows** — Different tools for video, audio, graphics, effects
### The RoadView Solution
| Problem | RoadView Solution |
|---------|-------------------|
| Steep learning curves | Natural language direction |
| Expensive software | Unified platform, accessible pricing |
| Time-consuming | AI-accelerated editing |
| Technical barriers | Automated enhancement, AI effects |
| Fragmented workflows | All-in-one creation studio |
---
## Key Features
### 1. Natural Language Editing
Edit with words, not timelines.
```
You: Make the intro more energetic with faster cuts
RoadView: I'll speed up the cuts in the first 15 seconds.
Here are 3 variations:
- Quick cuts (0.5s each) with beat sync
- Dynamic zoom transitions
- Jump cut style with motion blur
Which feels right? Or should I try something else?
```
### 2. AI Video Generation
Create video from text, images, or concepts.
| Input | Output |
|-------|--------|
| Script | Animated explainer video |
| Images | Slideshow with motion |
| Audio | Visualizer, music video |
| Concept | B-roll footage generation |
| Storyboard | Full scene rendering |
### 3. Smart Editing Tools
AI-powered enhancements:
- **Auto-cut** — Remove silences, ums, mistakes
- **Color grade** — Match professional looks or custom styles
- **Audio cleanup** — Remove background noise, normalize levels
- **Stabilization** — Smooth shaky footage
- **Upscaling** — Enhance resolution with AI
- **Object removal** — Clean up unwanted elements
### 4. Template System
Start with professional templates:
```
Templates:
├── YouTube
│ ├── Tutorial
│ ├── Vlog
│ ├── Review
│ └── Short
├── Social
│ ├── Instagram Reel
│ ├── TikTok
│ ├── Twitter/X Video
│ └── LinkedIn
├── Business
│ ├── Presentation
│ ├── Product Demo
│ ├── Testimonial
│ └── Ad
└── Personal
├── Memory Montage
├── Invitation
└── Celebration
```
### 5. Multi-Modal Creation
Unified tools for all media:
| Media Type | Capabilities |
|------------|--------------|
| **Video** | Editing, effects, transitions, titles |
| **Image** | Generation, editing, enhancement, compositing |
| **Audio** | Recording, editing, mixing, music generation |
| **Graphics** | Motion graphics, lower thirds, intros |
| **Voiceover** | AI voices, cloning, lip sync |
### 6. Collaboration & Publishing
Share and ship:
- **Real-time collaboration** — Work together on projects
- **Version history** — Track changes, restore previous versions
- **Direct publishing** — Post to YouTube, Instagram, TikTok
- **Asset library** — Organize and reuse media
- **Brand kits** — Consistent colors, fonts, logos
---
## Architecture
### Component Diagram
```
┌─────────────────────────────────────────────────────────────┐
│ ROADVIEW PORTAL │
├─────────────────────────────────────────────────────────────┤
│ │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ Editor UI │ │ Studio UI │ │ Mobile UI │ │
│ └──────┬───────┘ └──────┬───────┘ └──────┬───────┘ │
│ │ │ │ │
│ ┌──────▼─────────────────▼─────────────────▼──────┐ │
│ │ CREATION ENGINE │ │
│ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ ┌────────┐ │ │
│ │ │ Video │ │ Image │ │ Audio │ │Graphics│ │ │
│ │ │ Engine │ │ Engine │ │ Engine │ │ Engine │ │ │
│ │ └─────────┘ └─────────┘ └─────────┘ └────────┘ │ │
│ └──────────────────────┬──────────────────────────┘ │
│ │ │
│ ┌──────────────────────▼──────────────────────────┐ │
│ │ AI PROCESSING │ │
│ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ ┌────────┐ │ │
│ │ │Generate │ │Enhance │ │Transcribe│ │Voice │ │ │
│ │ │(Diffusion)│ │(Super-res)││(Whisper)│ │(TTS) │ │ │
│ │ └─────────┘ └─────────┘ └─────────┘ └────────┘ │ │
│ └──────────────────────┬──────────────────────────┘ │
│ │ │
│ ┌──────────────────────▼──────────────────────────┐ │
│ │ ASSET MANAGEMENT │ │
│ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ ┌────────┐ │ │
│ │ │ Storage │ │ Library │ │Templates│ │ Brand │ │ │
│ │ │ (R2) │ │ Manager │ │ Gallery │ │ Kits │ │ │
│ │ └─────────┘ └─────────┘ └─────────┘ └────────┘ │ │
│ └─────────────────────────────────────────────────┘ │
│ │
├──────────────────────────────────────────────────────────────┤
│ GOVERNANCE LAYER │
│ ┌─────────┐ ┌─────────┐ ┌─────────┐ ┌─────────┐ │
│ │ Cece │ │ Policy │ │ Ledger │ │ Intent │ │
│ └─────────┘ └─────────┘ └─────────┘ └─────────┘ │
└─────────────────────────────────────────────────────────────┘
```
### Project Model
```typescript
interface RoadViewProject {
project_id: string;
owner_id: string;
created_at: string;
updated_at: string;
// Project metadata
name: string;
description?: string;
type: 'video' | 'image' | 'audio' | 'graphics';
template_id?: string;
// Timeline/Composition
composition: Composition;
// Assets
assets: Asset[];
// Settings
settings: ProjectSettings;
// Collaboration
collaborators: Collaborator[];
version_history: Version[];
// Publishing
exports: Export[];
publications: Publication[];
}
interface Composition {
duration_ms: number;
resolution: {width: number; height: number};
fps: number;
tracks: Track[];
}
interface Track {
track_id: string;
type: 'video' | 'audio' | 'graphics' | 'effects';
clips: Clip[];
muted: boolean;
volume: number; // 0-100
locked: boolean;
}
interface Clip {
clip_id: string;
asset_id: string;
start_time_ms: number;
end_time_ms: number;
trim_start_ms: number;
trim_end_ms: number;
effects: Effect[];
transitions: {
in?: Transition;
out?: Transition;
};
}
interface Asset {
asset_id: string;
type: 'video' | 'image' | 'audio' | 'font' | 'lut' | 'template';
source: 'upload' | 'generated' | 'stock' | 'recorded';
url: string;
thumbnail_url?: string;
metadata: AssetMetadata;
}
```
---
## Creation Workflows
### Video Editing Flow
```
1. IMPORT
├── Upload files (video, audio, images)
├── Record (webcam, screen, voice)
├── Generate (AI video/images)
└── Stock library
2. ORGANIZE
├── Auto-transcribe video
├── AI scene detection
├── Tag and categorize
└── Create rough cut
3. EDIT
├── Timeline editing
├── Natural language commands
├── AI suggestions
└── Effects and transitions
4. ENHANCE
├── Color grading
├── Audio cleanup
├── Stabilization
└── Titles and graphics
5. EXPORT/PUBLISH
├── Render to file
├── Direct publish
└── Share link
```
### AI Generation Flow
```
You: Create a 30-second product video for a new coffee maker.
Modern, clean aesthetic. Show the brewing process.
RoadView: I'll create a product video. Let me understand more:
1. Do you have product images, or should I generate visuals?
2. What's the brand color palette?
3. Should there be voiceover, music, or both?
4. What's the call-to-action at the end?
You: Generate visuals, use earth tones, upbeat music only,
CTA is "Available now at coffeemaker.com"
RoadView: Creating your video...
[Preview generated]
Here's the first draft:
- 0:00-0:05: Product hero shot with subtle motion
- 0:05-0:15: Brewing process animation
- 0:15-0:25: Coffee pouring into cup, steam rising
- 0:25-0:30: Logo + CTA with website
Want me to adjust anything?
```
---
## AI Capabilities
### Video Generation
| Model | Use Case |
|-------|----------|
| **Text-to-Video** | Generate clips from descriptions |
| **Image-to-Video** | Animate still images |
| **Video-to-Video** | Style transfer, effects |
| **Upscaling** | 4K enhancement |
### Image Generation
| Model | Use Case |
|-------|----------|
| **Text-to-Image** | Thumbnails, backgrounds, assets |
| **Inpainting** | Remove/replace objects |
| **Outpainting** | Extend images |
| **Style Transfer** | Apply artistic styles |
### Audio Processing
| Model | Use Case |
|-------|----------|
| **Speech-to-Text** | Transcription, subtitles |
| **Text-to-Speech** | Voiceover generation |
| **Voice Cloning** | Match speaker voice |
| **Music Generation** | Background music, jingles |
| **Audio Separation** | Isolate vocals, instruments |
---
## User Experience
### Editor Interface
```
┌─────────────────────────────────────────────────────────────┐
│ RoadView │ My Coffee Ad ▼ │ │ [Preview] [Export] │
├─────────────────────────────────────────────────────────────┤
│ ┌─────────────┐ ┌─────────────────────────────────────┐ │
│ │ Assets │ │ │ │
│ │ ───────── │ │ VIDEO PREVIEW │ │
│ │ 📁 Clips │ │ │ │
│ │ 🎵 Audio │ │ [▶] 00:15 / 00:30 │ │
│ │ 🖼 Images │ │ │ │
│ │ ✨ Effects │ └─────────────────────────────────────┘ │
│ │ │ │
│ │ ───────── │ ┌─────────────────────────────────────┐ │
│ │ + Generate │ │ 🤖 AI Assistant │ │
│ │ + Upload │ │ │ │
│ │ + Record │ │ "Make the intro more dynamic" │ │
│ │ │ │ │ │
│ └─────────────┘ └─────────────────────────────────────┘ │
├─────────────────────────────────────────────────────────────┤
│ TIMELINE 🔊 🎵 │
│ ┌─────────────────────────────────────────────────────────┐│
│ │ V1 │████████████░░░░░░████████████████░░░░░░░░░░░│ ││
│ │ V2 │░░░░░░░░░░░░████████░░░░░░░░░░░░░░░░░░░░░░░░░│ ││
│ │ A1 │████████████████████████████████████████████│ ││
│ │ A2 │░░░░████████████████████████░░░░░░░░░░░░░░░░│ ││
│ └─────────────────────────────────────────────────────────┘│
│ │◀ ▶│ 00:00 00:10 00:20 00:30 │
└─────────────────────────────────────────────────────────────┘
```
### Natural Language Commands
```
Command Examples:
"Cut out all the pauses longer than 2 seconds"
"Add a fade transition between each clip"
"Make the colors warmer and more saturated"
"Speed up the middle section by 20%"
"Add subtitles in English and Spanish"
"Generate a thumbnail with the product"
"Add background music that matches the energy"
"Remove the background noise from the interview"
"Create a lower third for 'John Smith, CEO'"
```
---
## Governance Integration
### Content Policies
```yaml
policies:
- scope: roadview.generate.image
rules:
- condition: "prompt contains prohibited_content"
action: deny
reason: "Content violates usage policy"
- condition: "style == 'deepfake' AND consent != true"
action: deny
reason: "Deepfakes require explicit consent"
- scope: roadview.publish
rules:
- condition: "platform == 'youtube' AND monetized == true"
action: require_human_approval
reason: "Monetized content needs review"
```
### Usage Tracking
All generation and exports logged:
```json
{
"event_id": "evt-20251130-render001",
"intent_id": "int-20251130-video",
"agent_id": "roadview.renderer.v1",
"tool": "export",
"action": "render",
"metadata": {
"project_id": "proj-coffee-ad",
"format": "mp4",
"resolution": "1920x1080",
"duration_ms": 30000,
"ai_generated_percentage": 45
}
}
```
---
## API Endpoints
### Projects
```bash
# Create project
POST /portal/roadview/projects
{
"name": "Coffee Ad",
"type": "video",
"template_id": "product-demo"
}
# Get project
GET /portal/roadview/projects/{project_id}
# Update composition
PATCH /portal/roadview/projects/{project_id}/composition
{
"tracks": [...],
"duration_ms": 30000
}
```
### Assets
```bash
# Upload asset
POST /portal/roadview/assets/upload
Content-Type: multipart/form-data
# Generate asset
POST /portal/roadview/assets/generate
{
"type": "image",
"prompt": "Modern coffee maker, studio lighting, white background",
"style": "product-photography"
}
# Get asset library
GET /portal/roadview/assets?type=video&source=upload
```
### AI Operations
```bash
# Natural language edit
POST /portal/roadview/projects/{project_id}/ai/edit
{
"command": "Add a fade transition between each clip"
}
# Generate voiceover
POST /portal/roadview/ai/voiceover
{
"text": "Introducing the all-new CoffeeMaster Pro...",
"voice_id": "professional-male-1",
"speed": 1.0
}
# Transcribe video
POST /portal/roadview/ai/transcribe
{
"asset_id": "asset-interview-001",
"languages": ["en", "es"]
}
```
### Export & Publish
```bash
# Export project
POST /portal/roadview/projects/{project_id}/export
{
"format": "mp4",
"resolution": "1080p",
"quality": "high"
}
# Publish directly
POST /portal/roadview/projects/{project_id}/publish
{
"platform": "youtube",
"title": "CoffeeMaster Pro - Available Now",
"description": "...",
"tags": ["coffee", "kitchen", "appliance"]
}
```
---
## Pricing Tiers
| Tier | Price | Features |
|------|-------|----------|
| **Free** | $0/mo | 3 projects, 720p export, 5 min/video, watermark |
| **Creator** | $20/mo | Unlimited projects, 1080p, 30 min, no watermark |
| **Pro** | $50/mo | 4K export, AI generation (100 credits), stock library |
| **Studio** | $100/mo | Unlimited AI, collaboration, API access, priority render |
| **Enterprise** | Custom | White-label, custom models, dedicated infrastructure |
### AI Credits
| Operation | Credits |
|-----------|---------|
| Image generation | 1 |
| Video generation (10s) | 10 |
| Voiceover (1 min) | 2 |
| Music generation (1 min) | 5 |
| Upscaling (1 min video) | 3 |
---
## Roadmap
### MVP (Phase 1)
- [ ] Basic video editor
- [ ] Image generation
- [ ] Template library
- [ ] 1080p export
### Beta (Phase 2)
- [ ] Natural language editing
- [ ] Audio tools
- [ ] Voiceover generation
- [ ] Direct YouTube publish
### v1.0 (Phase 3)
- [ ] Video generation
- [ ] Collaboration
- [ ] Mobile editor
- [ ] Full stock library
### Future
- [ ] Real-time collaboration
- [ ] Live streaming tools
- [ ] 3D/VR content
- [ ] Custom AI model training
---
## Competitive Differentiation
| Feature | RoadView | Premiere Pro | CapCut | Canva Video |
|---------|----------|--------------|--------|-------------|
| AI editing commands | ✅ Advanced | ❌ None | ✅ Basic | ❌ None |
| Video generation | ✅ Built-in | ❌ Plugins | ❌ None | ✅ Basic |
| Learning curve | Low | Very High | Low | Low |
| Price | $20-100/mo | $23/mo | Free-$10 | Free-$13 |
| Governance/audit | ✅ Full | ❌ None | ❌ None | ❌ None |
| Cross-portal integration | ✅ Yes | ❌ No | ❌ No | ❌ No |
---
## References
- [Lucidia Portal](./lucidia.md)
- [RoadWork Portal](./roadwork.md)
- [Architecture Overview](../meta/vision/architecture.md)