@konnektaro/speech-to-text

v1.1.8

Published

5 months ago

A React component library for speech-to-text conversion with dual-mode support (API mode + native browser Speech Recognition)

0High
0Medium
0Low

gugann

react speech-to-text speech transcription microphone webrtc typescript speech-recognition audio-recording voice-to-text browser-speech-api

@konnektaro/speech-to-text

A React component library for speech-to-text conversion with dual-mode support. This package provides a clean, simple interface for capturing speech and converting it to text via custom API calls or native browser Speech Recognition API.

Features

🎤 Simple Speech Recording: Clean microphone interface with visual feedback
📱 Cross-Platform: Works on both mobile and web browsers
🔄 Dual Mode Support: Custom API or native browser Speech Recognition
🔐 Optional Authentication: Secure API communication with optional bearer tokens
🎨 State-Based Color System: Advanced color customization with separate colors for idle, active, disabled, and transcribing states
⚡ Auto-Conversion: Automatically converts speech to text when recording stops
🛡️ Permission Handling: Graceful microphone permission management
📡 Axios HTTP Client: Robust HTTP requests with error handling
⏱️ Configurable Timeouts: Customizable request timeouts
🔧 TypeScript Support: Full type safety and IntelliSense
🌐 Browser Fallback: Uses native Speech Recognition API when no API is configured

Installation

npm install @konnektaro/speech-to-text
# or
yarn add @konnektaro/speech-to-text

Quick Start

Method 1: Using Custom API (Recommended for Production)

import React from 'react';
import { KonnektaroAudioRecorder } from '@konnektaro/speech-to-text';

function App() {
  const handleTranscriptionComplete = (transcription: string) => {
    console.log('Transcription:', transcription);
  };

  const handleError = (error: string) => {
    console.error('Error:', error);
  };

  return (
    <KonnektaroAudioRecorder
      apiUrl="https://your-api.com"
      token="your-auth-token" // Optional but recommended for security
      onTranscriptionComplete={handleTranscriptionComplete}
      onError={handleError}
      colors={{
        idle: { background: "#3b82f6", icon: "#ffffff" },
        active: { background: "#ef4444", icon: "#ffffff" },
        disabled: { background: "#9ca3af", icon: "#ffffff" },
        transcribing: { background: "#f59e0b", icon: "#ffffff" },
        ripple: "#ef4444"
      }}
    />
  );
}

API Mode without Token (Less Secure)

import React from 'react';
import { KonnektaroAudioRecorder } from '@konnektaro/speech-to-text';

function App() {
  return (
    <KonnektaroAudioRecorder
      apiUrl="https://your-api.com"
      // No token provided - API should handle unauthenticated requests
      onTranscriptionComplete={(transcription) => console.log(transcription)}
      onError={(error) => console.error(error)}
      colors={{
        idle: { background: "#10b981", icon: "#ffffff" },
        active: { background: "#059669", icon: "#ffffff" },
        disabled: { background: "#9ca3af", icon: "#ffffff" },
        transcribing: { background: "#f59e0b", icon: "#ffffff" },
        ripple: "#059669"
      }}
    />
  );
}

Method 2: Using Native Browser Speech Recognition (No Setup Required)

import React from 'react';
import { KonnektaroAudioRecorder } from '@konnektaro/speech-to-text';

function App() {
  return (
    <KonnektaroAudioRecorder
      onTranscriptionComplete={(transcription) => console.log(transcription)}
      onError={(error) => console.error(error)}
      colors={{
        idle: { background: "#8b5cf6", icon: "#ffffff" },
        active: { background: "#7c3aed", icon: "#ffffff" },
        disabled: { background: "#9ca3af", icon: "#ffffff" },
        transcribing: { background: "#f59e0b", icon: "#ffffff" },
        ripple: "#a855f7"
      }}
    />
  );
}

API Reference

KonnektaroAudioRecorder Props

| Prop | Type | Required | Default | Description | |------|------|----------|---------|-------------| | apiUrl | string | No | - | API URL for transcription service (enables API mode) | | token | string | No | - | Authentication token for API requests (optional but recommended) | | timeout | number | No | 60000 | Request timeout in milliseconds (API mode only) | | onTranscriptionComplete | (transcription: string) => void | No | - | Callback when transcription is completed | | onError | (error: string) => void | No | - | Callback when an error occurs | | colors | ColorConfig | No | See below | State-based color configuration |

ColorConfig Interface

interface ColorState {
  background?: string;  // Background color
  icon?: string;        // Icon color
  border?: string;      // Border color
  boxShadow?: string;   // Box shadow
  [key: string]: string | undefined; // Any CSS property
}

interface ColorConfig {
  idle?: ColorState;         // Colors when idle/ready
  active?: ColorState;       // Colors when recording/listening
  disabled?: ColorState;     // Colors when disabled
  transcribing?: ColorState; // Colors when transcribing
  ripple?: string;          // Ripple effect color
  global?: ColorState;      // Global overrides for all states
}

Default Colors:

{
  idle: { background: "#8b5cf6", icon: "#ffffff" },
  active: { background: "#8b5cf6", icon: "#ffffff" },
  disabled: { background: "#6b7280", icon: "#ffffff" },
  transcribing: { background: "#8b5cf6", icon: "#ffffff" },
  ripple: "#a855f7"
}

Smart Defaults: You only need to specify the properties you want to change. All other properties will use sensible defaults.

Mode Selection:

API Mode: Provide apiUrl prop (token is optional but recommended for security)
Speech API Mode: Omit apiUrl prop to use native browser Speech Recognition API

Mode Comparison

| Feature | API Mode | Speech API Mode | |---------|----------|-----------------| | Setup Required | Yes (API endpoint, token optional) | No | | Browser Support | All modern browsers | Chrome, Edge, Safari (limited) | | Audio Quality | High (WebM/Opus) | Browser-dependent | | Transcription Accuracy | Depends on your API | Browser-dependent | | Privacy | Audio sent to your server | Processed locally | | Offline Support | No | Yes (browser-dependent) | | Customization | Full control over processing | Limited to browser capabilities | | Cost | Depends on your API | Free |

Other Exports

import { 
  KonnektaroAudioRecorder, // Main component for speech-to-text
  useAudioRecorder,        // Hook for audio recording logic
  transcribeAudio,         // Function for manual transcription
  testConnection          // Function for testing API connection
} from '@konnektaro/speech-to-text';

// Type imports
import type { 
  KonnektaroAudioRecorderProps,
  AudioRecorderState,
  AudioRecorderControls,
  TranscriptionResponse
} from '@konnektaro/speech-to-text';

Using Exported Functions

You can also use the transcription functions directly:

import { transcribeAudio, testConnection } from '@konnektaro/speech-to-text';

// With authentication
const result = await transcribeAudio(audioBlob, 'https://api.example.com', 'your-token');

// Without authentication (token is optional)
const result = await transcribeAudio(audioBlob, 'https://api.example.com');

// Test connection with optional token
const isConnected = await testConnection('https://api.example.com', 'your-token');
// or without token
const isConnected = await testConnection('https://api.example.com');

API Integration

When using API mode, your transcription service should implement the following endpoints:

POST /api/transcribe

Purpose: Transcribe audio to text

Headers:

Authorization: Bearer YOUR_TOKEN  # Optional - only sent if token is provided
Content-Type: multipart/form-data

Request Body:

audio: WebM audio blob (multipart form data)

Success Response (200):

{
  "transcription": "The transcribed text",
  "success": true
}

Error Response (4xx/5xx):

{
  "error": "Error message describing what went wrong",
  "success": false
}

Example Implementation (Node.js/Express):

const multer = require('multer');
const upload = multer({ storage: multer.memoryStorage() });

// Optional authentication middleware
const authenticateToken = (req, res, next) => {
  const authHeader = req.headers['authorization'];
  const token = authHeader && authHeader.split(' ')[1];

  if (token) {
    // Verify token if provided
    if (!isValidToken(token)) {
      return res.status(403).json({ error: 'Invalid token' });
    }
  }
  // Continue even without token (optional authentication)
  next();
};

app.post('/api/transcribe', authenticateToken, upload.single('audio'), async (req, res) => {
  try {
    const audioBuffer = req.file.buffer;
    
    // Your transcription logic here
    const transcription = await transcribeAudio(audioBuffer);
    
    res.json({
      transcription: transcription,
      success: true
    });
  } catch (error) {
    res.status(500).json({
      error: error.message,
      success: false
    });
  }
});

GET /api/health (Optional)

Purpose: Health check endpoint for connection testing

Headers:

Authorization: Bearer YOUR_TOKEN  # Optional - only sent if token is provided

Success Response:

HTTP 200: Service is healthy
HTTP 404: Also considered valid (endpoint not implemented)

Example Implementation:

app.get('/api/health', (req, res) => {
  // Optional: Add health checks (database, external services, etc.)
  res.status(200).json({ status: 'healthy' });
});

Authentication

The component sends the token in the Authorization header as a Bearer token only if a token is provided. Your API can handle both authenticated and unauthenticated requests:

Option 1: Required Authentication (Recommended for Production)

const authenticateToken = (req, res, next) => {
  const authHeader = req.headers['authorization'];
  const token = authHeader && authHeader.split(' ')[1];

  if (!token) {
    return res.status(401).json({ error: 'Access token required' });
  }

  // Verify token (implement your token validation logic)
  if (!isValidToken(token)) {
    return res.status(403).json({ error: 'Invalid token' });
  }

  next();
};

app.post('/api/transcribe', authenticateToken, upload.single('audio'), ...);
app.get('/api/health', authenticateToken, ...);

Option 2: Optional Authentication

const optionalAuth = (req, res, next) => {
  const authHeader = req.headers['authorization'];
  const token = authHeader && authHeader.split(' ')[1];

  if (token) {
    // Verify token if provided
    if (!isValidToken(token)) {
      return res.status(403).json({ error: 'Invalid token' });
    }
    req.user = getUserFromToken(token);
  } else {
    req.user = null; // No authentication
  }

  next();
};

app.post('/api/transcribe', optionalAuth, upload.single('audio'), ...);
app.get('/api/health', optionalAuth, ...);

Option 3: No Authentication (Development Only)

// No authentication middleware - anyone can access
app.post('/api/transcribe', upload.single('audio'), ...);
app.get('/api/health', ...);

CORS Configuration

Make sure your API server includes proper CORS headers:

const cors = require('cors');

app.use(cors({
  origin: ['http://localhost:3000', 'https://yourdomain.com'],
  credentials: true,
  methods: ['GET', 'POST', 'OPTIONS'],
  allowedHeaders: ['Content-Type', 'Authorization']
}));

Alternative Implementations

Python (FastAPI)

from fastapi import FastAPI, File, UploadFile, HTTPException, Depends
from fastapi.middleware.cors import CORSMiddleware
import jwt

app = FastAPI()

app.add_middleware(
    CORSMiddleware,
    allow_origins=["http://localhost:3000", "https://yourdomain.com"],
    allow_credentials=True,
    allow_methods=["*"],
    allow_headers=["*"],
)

def verify_token(token: str = Depends(oauth2_scheme)):
    # Implement your token verification logic
    try:
        payload = jwt.decode(token, SECRET_KEY, algorithms=["HS256"])
        return payload
    except jwt.PyJWTError:
        raise HTTPException(status_code=403, detail="Invalid token")

@app.post("/api/transcribe")
async def transcribe_audio(
    audio: UploadFile = File(...),
    token_data: dict = Depends(verify_token)
):
    try:
        audio_content = await audio.read()
        # Your transcription logic here
        transcription = await transcribe_audio_content(audio_content)
        
        return {
            "transcription": transcription,
            "success": True
        }
    except Exception as e:
        raise HTTPException(status_code=500, detail=str(e))

@app.get("/api/health")
async def health_check(token_data: dict = Depends(verify_token)):
    return {"status": "healthy"}

Python (Flask)

from flask import Flask, request, jsonify
from flask_cors import CORS
import jwt

app = Flask(__name__)
CORS(app, origins=['http://localhost:3000', 'https://yourdomain.com'])

def verify_token(f):
    @wraps(f)
    def decorated(*args, **kwargs):
        token = request.headers.get('Authorization')
        if not token or not token.startswith('Bearer '):
            return jsonify({'error': 'Access token required'}), 401
        
        try:
            token = token.split(' ')[1]
            payload = jwt.decode(token, SECRET_KEY, algorithms=['HS256'])
            request.user = payload
        except jwt.PyJWTError:
            return jsonify({'error': 'Invalid token'}), 403
        
        return f(*args, **kwargs)
    return decorated

@app.route('/api/transcribe', methods=['POST'])
@verify_token
def transcribe():
    try:
        audio_file = request.files['audio']
        # Your transcription logic here
        transcription = transcribe_audio_file(audio_file)
        
        return jsonify({
            'transcription': transcription,
            'success': True
        })
    except Exception as e:
        return jsonify({
            'error': str(e),
            'success': False
        }), 500

@app.route('/api/health', methods=['GET'])
@verify_token
def health():
    return jsonify({'status': 'healthy'})

PHP (Laravel)

<?php

// routes/api.php
Route::middleware('auth:sanctum')->group(function () {
    Route::post('/transcribe', [TranscriptionController::class, 'transcribe']);
    Route::get('/health', [HealthController::class, 'check']);
});

// app/Http/Controllers/TranscriptionController.php
class TranscriptionController extends Controller
{
    public function transcribe(Request $request)
    {
        try {
            $audioFile = $request->file('audio');
            
            // Your transcription logic here
            $transcription = $this->transcribeAudio($audioFile);
            
            return response()->json([
                'transcription' => $transcription,
                'success' => true
            ]);
        } catch (Exception $e) {
            return response()->json([
                'error' => $e->getMessage(),
                'success' => false
            ], 500);
        }
    }
}

// app/Http/Controllers/HealthController.php
class HealthController extends Controller
{
    public function check()
    {
        return response()->json(['status' => 'healthy']);
    }
}

Testing Your API

You can test your API endpoints using curl or any HTTP client:

Test Health Endpoint (with token)

curl -X GET "https://your-api.com/api/health" \
  -H "Authorization: Bearer YOUR_TOKEN"

Test Health Endpoint (without token)

curl -X GET "https://your-api.com/api/health"

Test Transcription Endpoint (with token)

curl -X POST "https://your-api.com/api/transcribe" \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -F "audio=@path/to/audio.webm"

Test Transcription Endpoint (without token)

curl -X POST "https://your-api.com/api/transcribe" \
  -F "audio=@path/to/audio.webm"

Test with JavaScript (for debugging)

With Authentication:

// Test health endpoint with token
fetch('https://your-api.com/api/health', {
  method: 'GET',
  headers: {
    'Authorization': 'Bearer YOUR_TOKEN'
  }
})
.then(response => response.json())
.then(data => console.log('Health check:', data));

// Test transcription endpoint with token
const formData = new FormData();
formData.append('audio', audioBlob); // Your WebM audio blob

fetch('https://your-api.com/api/transcribe', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer YOUR_TOKEN'
  },
  body: formData
})
.then(response => response.json())
.then(data => console.log('Transcription:', data));

Without Authentication:

// Test health endpoint without token
fetch('https://your-api.com/api/health', {
  method: 'GET'
})
.then(response => response.json())
.then(data => console.log('Health check:', data));

// Test transcription endpoint without token
const formData = new FormData();
formData.append('audio', audioBlob); // Your WebM audio blob

fetch('https://your-api.com/api/transcribe', {
  method: 'POST',
  body: formData
})
.then(response => response.json())
.then(data => console.log('Transcription:', data));

Common Issues and Solutions

CORS Errors

If you encounter CORS errors, make sure your server includes the proper headers:

Access-Control-Allow-Origin: https://yourdomain.com
Access-Control-Allow-Methods: GET, POST, OPTIONS
Access-Control-Allow-Headers: Content-Type, Authorization

Authentication Errors

Ensure your token is valid and not expired
Check that the Authorization header format is correct: Bearer YOUR_TOKEN
Verify your token validation logic

Audio Format Issues

The component sends WebM audio blobs
Make sure your transcription service supports WebM format
Consider converting to other formats if needed (WAV, MP3, etc.)

File Size Limits

WebM files can be large for long recordings
Consider implementing file size limits on your server
Add progress indicators for large file uploads

Browser Support

API Mode

Modern Browsers: Chrome, Firefox, Safari, Edge (latest versions)
Mobile: iOS Safari, Chrome Mobile, Samsung Internet
Requirements: WebRTC support, MediaRecorder API

Speech API Mode

Chrome: Full support (desktop and mobile)
Edge: Full support (desktop and mobile)
Safari: Limited support (desktop only, requires user gesture)
Firefox: Not supported
Mobile: iOS Safari (limited), Chrome Mobile (full support)
Requirements: Speech Recognition API support

Audio Formats

Recording: WebM with Opus codec (high quality, good compression)
Supported Input: Any audio format supported by MediaRecorder API
Output: WebM audio blob sent to your transcription service

Styling

The component includes a powerful state-based color system that allows you to customize colors for different states. You can customize the appearance in several ways:

1. Using Flexible Color System (Recommended)

// Complete color customization with CSS properties
<KonnektaroAudioRecorder 
  colors={{
    idle: { background: "#3b82f6", icon: "#ffffff" },        // Blue when ready
    active: { background: "#ef4444", icon: "#ffffff", border: "2px solid #dc2626" }, // Red with border
    disabled: { background: "#9ca3af", icon: "#ffffff" },    // Gray when disabled
    transcribing: { background: "#f59e0b", icon: "#ffffff" }, // Amber when processing
    ripple: "#ef4444"                                        // Red ripple effect
  }}
  onTranscriptionComplete={handleTranscription}
  onError={handleError}
/>

// Minimal customization (only override what you need)
<KonnektaroAudioRecorder 
  colors={{
    active: { background: "#10b981" },                       // Just change active background
    ripple: "#34d399"                                        // And ripple color
  }}
  onTranscriptionComplete={handleTranscription}
  onError={handleError}
/>

// Global overrides (apply to all states)
<KonnektaroAudioRecorder 
  colors={{
    global: { icon: "#000000" },                             // Black icon for all states
    active: { background: "#ef4444", boxShadow: "0 0 20px rgba(239, 68, 68, 0.5)" }
  }}
  onTranscriptionComplete={handleTranscription}
  onError={handleError}
/>

// Advanced CSS properties
<KonnektaroAudioRecorder 
  colors={{
    active: { 
      background: "linear-gradient(45deg, #ff6b6b, #ee5a24)",
      border: "3px solid #ff4757",
      boxShadow: "0 8px 32px rgba(255, 71, 87, 0.3)",
      transform: "scale(1.05)"
    }
  }}
  onTranscriptionComplete={handleTranscription}
  onError={handleError}
/>

2. Default Styling

If no colors are provided, the component uses a beautiful purple theme:

<KonnektaroAudioRecorder 
  onTranscriptionComplete={handleTranscription}
  onError={handleError}
/>
// Uses default purple colors for all states

3. CSS Customization

You can also override styles using CSS:

/* Custom styles for the microphone button */
.konnektaro-audio-recorder button {
  box-shadow: 0 4px 20px rgba(0, 0, 0, 0.15);
  transition: all 0.3s ease;
}

/* Custom ripple animation */
@keyframes custom-ripple {
  0% { transform: scale(0.8); opacity: 0.8; }
  100% { transform: scale(1.5); opacity: 0; }
}

4. Color Theme Examples

Here are some popular color combinations using the flexible system:

// Professional Blue Theme
<KonnektaroAudioRecorder 
  colors={{
    idle: { background: "#3b82f6", icon: "#ffffff" },
    active: { background: "#2563eb", icon: "#ffffff", boxShadow: "0 0 20px rgba(37, 99, 235, 0.3)" },
    disabled: { background: "#9ca3af", icon: "#ffffff" },
    transcribing: { background: "#1d4ed8", icon: "#ffffff" },
    ripple: "#60a5fa"
  }}
/>

// Success Green Theme with Border
<KonnektaroAudioRecorder 
  colors={{
    idle: { background: "#10b981", icon: "#ffffff" },
    active: { background: "#059669", icon: "#ffffff", border: "2px solid #047857" },
    disabled: { background: "#9ca3af", icon: "#ffffff" },
    transcribing: { background: "#047857", icon: "#ffffff" },
    ripple: "#34d399"
  }}
/>

// Gradient Theme
<KonnektaroAudioRecorder 
  colors={{
    idle: { background: "linear-gradient(135deg, #667eea 0%, #764ba2 100%)", icon: "#ffffff" },
    active: { background: "linear-gradient(135deg, #f093fb 0%, #f5576c 100%)", icon: "#ffffff" },
    disabled: { background: "#9ca3af", icon: "#ffffff" },
    transcribing: { background: "linear-gradient(135deg, #4facfe 0%, #00f2fe 100%)", icon: "#ffffff" },
    ripple: "#f5576c"
  }}
/>

// Dark Theme with Glow Effect
<KonnektaroAudioRecorder 
  colors={{
    idle: { background: "#4b5563", icon: "#ffffff" },
    active: { background: "#374151", icon: "#ffffff", boxShadow: "0 0 30px rgba(55, 65, 81, 0.5)" },
    disabled: { background: "#6b7280", icon: "#ffffff" },
    transcribing: { background: "#1f2937", icon: "#ffffff" },
    ripple: "#9ca3af"
  }}
/>

// Minimalist Theme (only active state different)
<KonnektaroAudioRecorder 
  colors={{
    active: { background: "#ef4444", icon: "#ffffff" },
    ripple: "#fca5a5"
  }}
/>

// Global Icon Override
<KonnektaroAudioRecorder 
  colors={{
    global: { icon: "#000000" }, // Black icon for all states
    active: { background: "#ef4444" },
    ripple: "#fca5a5"
  }}
/>

// Advanced Custom Styling
<KonnektaroAudioRecorder 
  colors={{
    active: { 
      background: "radial-gradient(circle, #ff6b6b, #ee5a24)",
      border: "3px solid #ff4757",
      boxShadow: "0 8px 32px rgba(255, 71, 87, 0.4), inset 0 1px 0 rgba(255, 255, 255, 0.2)",
      transform: "scale(1.05)",
      borderRadius: "50%"
    },
    ripple: "#ff4757"
  }}
/>

Error Handling

The component handles various error scenarios:

Configuration Errors: Missing API URL or token
Permission Errors: Microphone access denied
Network Errors: API connection failures
Transcription Errors: API response errors

All errors are passed to the onError callback and displayed in the UI.

Development

Building the Package

npm run build

Development Mode

npm run dev

Type Checking

npm run type-check

Linting

npm run lint

Security Considerations

Tokens are handled securely in memory only
No audio data is stored locally
All API communication should use HTTPS in production
CORS headers should be configured on your API server

Contributing

Fork the repository
Create a feature branch
Make your changes
Test thoroughly on multiple browsers/devices
Submit a pull request

License

MIT License - see LICENSE file for details.

Changelog

1.1.5

🎨 Flexible Color System: Complete redesign with support for any CSS property (background, border, boxShadow, transform, etc.)
🔧 Smart Defaults: Only specify the properties you want to change - everything else uses sensible defaults
🌍 Global Overrides: Set common properties (like icon color) for all states with the global option
🎯 Enhanced Flexibility: Support for gradients, borders, shadows, transforms, and any CSS property
📚 Updated Documentation: Comprehensive examples showing advanced styling capabilities
✨ Backward Compatibility: Maintains default purple theme when no colors are specified

1.1.2

✨ Enhanced Styling: Added customizable color props for microphone button and ripple effects
🎨 New Color Props: activeBackgroundColor, disabledBackgroundColor, idleBackgroundColor, iconColor, rippleColor
🌊 Improved Ripple Effect: Smaller, more subtle ripples with smooth fade-out animation
🎯 Better UX: Enhanced visual feedback with customizable state-based colors
📚 Updated Documentation: Comprehensive styling guide with color examples and themes
🔧 Code Cleanup: Removed unused props and improved component structure

1.1.0

Added native browser Speech Recognition API fallback
Centered microphone button in UI
Dual-mode support (API mode + Speech API mode)
Made authentication token optional (recommended for production)
Enhanced browser compatibility
Updated documentation with comprehensive API integration examples
Added support for multiple programming languages (Node.js, Python, PHP)
Improved error handling and user feedback

1.0.x

Initial release
React component for speech-to-text conversion
TypeScript support
Multiple UI variants

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@konnektaro/speech-to-text

Features

Installation

Quick Start

Method 1: Using Custom API (Recommended for Production)

API Mode without Token (Less Secure)

Method 2: Using Native Browser Speech Recognition (No Setup Required)

API Reference

KonnektaroAudioRecorder Props

ColorConfig Interface

Mode Comparison

Other Exports

Using Exported Functions

API Integration

POST /api/transcribe

GET /api/health (Optional)

Authentication

Option 1: Required Authentication (Recommended for Production)

Option 2: Optional Authentication

Option 3: No Authentication (Development Only)

CORS Configuration

Alternative Implementations

Python (FastAPI)

Python (Flask)

PHP (Laravel)

Testing Your API

Test Health Endpoint (with token)

Test Health Endpoint (without token)

Test Transcription Endpoint (with token)

Test Transcription Endpoint (without token)

Test with JavaScript (for debugging)

Common Issues and Solutions

CORS Errors

Authentication Errors

Audio Format Issues

File Size Limits

Browser Support

API Mode

Speech API Mode

Audio Formats

Styling

1. Using Flexible Color System (Recommended)

2. Default Styling

3. CSS Customization

4. Color Theme Examples

Error Handling

Development

Building the Package

Development Mode

Type Checking

Linting

Security Considerations

Contributing

License

Changelog

1.1.5

1.1.2

1.1.0

1.0.x