Product Overview
Update:
Product Overview
Speech recognition is a service that accurately recognizes speech as text, enriching users’ communication methods. Combined with natural language processing (NLP), user intentions can be derived from speech, providing a basis for developers to formulate operational strategies.
Product Features
- Supports multiple languages: able to recognize more than 20 languages including English, Japanese, Korean, Chinese, Indonesian, Filipino, Thai, Vietnamese, Arabic, Portuguese, Spanish, and Turkish.
- Supports custom hot words: You can upload a personalized word list to transcribe terms and uncommon words in specific fields, and improve the recognition accuracy of specific words or phrases.
- Anti-noise interference: Able to handle noisy audio from a variety of environments without the need for additional noise reduction measures.
Application Scenes
- Supports accurate recognition of short audio (≤60 seconds) into text and returns results in real time
- Supports converting long audio (<5 hours) into text data, providing a basis for text mining
- Supports speaker differentiation (Chinese only), providing a basis for customer service quality inspection
Product Overview
Speech recognition is a service that accurately recognizes speech as text, enriching users’ communication methods. Combined with natural language processing (NLP), user intentions can be derived from speech, providing a basis for developers to formulate operational strategies.
Product Features
- Supports multiple languages: able to recognize more than 20 languages including English, Japanese, Korean, Chinese, Indonesian, Filipino, Thai, Vietnamese, Arabic, Portuguese, Spanish, and Turkish.
- Supports custom hot words: You can upload a personalized word list to transcribe terms and uncommon words in specific fields, and improve the recognition accuracy of specific words or phrases.
- Anti-noise interference: Able to handle noisy audio from a variety of environments without the need for additional noise reduction measures.
Application Scenes
- Supports accurate recognition of short audio (≤60 seconds) into text and returns results in real time
- Supports converting long audio (<5 hours) into text data, providing a basis for text mining
- Supports speaker differentiation (Chinese only), providing a basis for customer service quality inspection