Troubleshooting

Common issues and how to resolve them.

Installation Issues

DemoVoice requires ffmpeg and ffprobe on your PATH.

Solution: Install ffmpeg via your package manager:

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt install ffmpeg

# Verify installation
ffmpeg -version

DemoVoice requires Go 1.22 or later.

Solution: Update Go to the latest version:

# Check version
go version

# Update via your package manager or download from go.dev

DemoVoice requires an OpenAI API key for transcription and TTS.

Solution: Set the environment variable:

export OPENAI_API_KEY=sk-your-key-here

OpenAI has rate limits on API calls.

Solution: Reduce concurrency in your config:

profiles:
  default:
    segment_concurrency: 2  # Reduce from default 4

Whisper may struggle with certain audio conditions.

Possible causes:

Solutions:

Some spoken content isn't being detected.

Solution: Check your timing parameters:

profiles:
  default:
    min_segment_seconds: 1.0  # Lower threshold for short segments

The generated speech doesn't match the original timing well.

Solution: Adjust stretch/compress limits:

profiles:
  default:
    max_segment_stretch: 1.20   # Allow 20% slower
    max_segment_compress: 0.85  # Allow 15% faster

Segments end abruptly before the speech completes.

Solution: Increase silence padding:

profiles:
  default:
    silence_padding_ms: 500  # Increase from default 350

The rendered video is corrupted or unplayable.

Solution: Check ffmpeg output for errors by running with the verbose flag:

demovoice render demo.mp4 --verbose --output demo.demovoice.mp4

The new audio track doesn't align with the video.

Solution: This usually indicates a timing calculation issue. Try:

Open an issue on GitHub with your config file and the verbose output from your command.