audio_transcribe

Work in Progress This skill is currently under development and may change significantly.

Type: Shared Skill
Scope: All agents
Location: /skills/audio_transcribe/SKILL.md

Copy This Skill

---
name: audio_transcribe
description: Transcribe audio files into text for processing
---

# Audio Transcribe

Transcribes audio files (voice messages, recordings) into text for processing by the agent.

## Usage

Invoke with: /audio_transcribe [audio_url_or_attachment]

## Examples

- /audio_transcribe https://example.com/meeting-recording.mp3
- /audio_transcribe [attached voice message]

## Supported Formats

- MP3
- WAV
- OGG
- M4A
- WebM

## Output

The agent will return:

## Transcription

[Full transcribed text of the audio]

---

Duration: 3:42
Confidence: 94%

## Options

Additional processing flags:
- --summarize - Provide a summary of the transcription
- --extract-action-items - Extract action items from the audio
- --speaker-identification - Identify different speakers

## Notes

- Maximum file size: 50MB
- For longer recordings, the transcription may take several minutes
- Quality depends on audio clarity and background noise
- Speaker identification works best with distinct voices

📋 Click to view SKILL.md content

---
name: audio_transcribe
description: Transcribe audio files into text for processing
---

# Audio Transcribe

Transcribes audio files (voice messages, recordings) into text for processing by the agent.

## Usage

Invoke with: /audio_transcribe [audio_url_or_attachment]

## Examples

- /audio_transcribe https://example.com/meeting-recording.mp3
- /audio_transcribe [attached voice message]

## Supported Formats

- MP3
- WAV
- OGG
- M4A
- WebM

## Output

The agent will return:

## Transcription

[Full transcribed text of the audio]

---

Duration: 3:42 Confidence: 94%

## Options

Additional processing flags:

- --summarize - Provide a summary of the transcription
- --extract-action-items - Extract action items from the audio
- --speaker-identification - Identify different speakers

## Notes

- Maximum file size: 50MB
- For longer recordings, the transcription may take several minutes
- Quality depends on audio clarity and background noise
- Speaker identification works best with distinct voices

Description

Transcribes audio files (voice messages, recordings) into text for processing by the agent.

Usage

/audio_transcribe [audio_url_or_attachment]

Examples

/audio_transcribe https://example.com/meeting-recording.mp3
/audio_transcribe [attached voice message]

Supported Formats

MP3
WAV
OGG
M4A
WebM

Output

The agent will return:

## Transcription

[Full transcribed text of the audio]

---

**Duration**: 3:42  
**Confidence**: 94%

Options

You can request additional processing:

/audio_transcribe [audio] --summarize
/audio_transcribe [audio] --extract-action-items
/audio_transcribe [audio] --speaker-identification

Notes

Maximum file size: 50MB
For longer recordings, the transcription may take several minutes
Quality depends on audio clarity and background noise
Speaker identification works best with distinct voices

Copy This Skill​

Description​

Usage​

Examples​

Supported Formats​

Output​

Options​

Notes​