summaryrefslogtreecommitdiff
path: root/internal/audio
diff options
context:
space:
mode:
authorPaul Buetow <paul@buetow.org>2025-07-15 07:48:58 +0300
committerPaul Buetow <paul@buetow.org>2025-07-15 07:48:58 +0300
commitc791e9fdd57af52599de266facbaba0077f31558 (patch)
treea5eb002a02b4cf279897d748bdcdc5b179c50098 /internal/audio
parentb03d096d12df59b66cf52991c46dfce44c20ae3b (diff)
feat: add OpenAI DALL-E image generation and make OpenAI defaultv0.1.0
- Implement OpenAI DALL-E provider for generating educational flashcard images - Add support for DALL-E 2 and DALL-E 3 with configurable size, quality, and style - Implement intelligent caching to minimize API costs - Make OpenAI the default provider for both audio (TTS) and images (DALL-E) - Add automatic fallback to free alternatives (espeak/pixabay) when OpenAI unavailable - Fix bug where cached images couldn't be copied to output directory - Update documentation with OpenAI setup instructions and examples - Add comprehensive unit tests for OpenAI image provider - Bump version to 0.1.0 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
Diffstat (limited to 'internal/audio')
-rw-r--r--internal/audio/provider.go2
1 files changed, 1 insertions, 1 deletions
diff --git a/internal/audio/provider.go b/internal/audio/provider.go
index 5b8c336..c803b61 100644
--- a/internal/audio/provider.go
+++ b/internal/audio/provider.go
@@ -44,7 +44,7 @@ type Config struct {
// DefaultConfig returns default configuration
func DefaultProviderConfig() *Config {
return &Config{
- Provider: "espeak",
+ Provider: "openai",
OutputDir: "./",
OutputFormat: "mp3",
ESpeakVoice: "bg",