Amazon Translate: What It Is and When to Use It

Q: How do I handle text that is longer than the real-time API limit?

The synchronous `TranslateText` API has a limit of 10,000 bytes per request. For longer texts or documents, you have two primary options: 1) Split the text into smaller segments (like paragraphs) and make multiple `TranslateText` API calls, or 2) Use the asynchronous Batch Translation feature, which is designed for large documents and can process files up to 20 MB each and batches up to 5 GB.

Definition

Amazon Translate is a fully managed neural machine translation service that uses deep learning models to deliver fast, high-quality, and affordable language translation. It enables developers to easily translate large volumes of text and documents between a wide array of supported languages, solving the problem of localizing content and enabling real-time, multilingual communication within applications.

How It Works

Amazon Translate is built on a neural network architecture that considers the entire context of a source sentence to produce more accurate and natural-sounding translations than older statistical or rule-based methods. The core of the service is a simple API that accepts source text, a source language (which can be auto-detected), and a target language, returning the translated text in UTF-8 format.

Architecture & Data Flow:

Request Initiation: A client application (e.g., a web server, an AWS Lambda function) sends a request to the Amazon Translate API endpoint. This can be a real-time request (TranslateText or TranslateDocument) or a batch request (StartTextTranslationJob).
Input Processing:
- For real-time translation, the service accepts a string of text (up to 10,000 bytes) or a single document (TXT, HTML, DOCX).
- For batch translation, the service reads documents (TXT, HTML, DOCX, PPTX, XLSX, XLIFF) from a specified Amazon Simple Storage Service (S3) bucket.
Language Detection (Optional): If the source language is set to auto, Amazon Translate internally calls Amazon Comprehend to accurately identify the source language before proceeding.
Translation & Customization: The neural translation engine processes the text. During this phase, it can apply several optional customizations:
- Custom Terminology: Applies a user-defined glossary (CSV, TMX, or TSV file) to ensure specific brand names, product names, or unique terms are translated consistently.
- Active Custom Translation (ACT): For batch jobs, you can provide a corpus of example translations (called Parallel Data) to influence the style, tone, and word choice of the output, adapting it to a specific domain without training a separate model.
- Formality Control: You can specify Formal or Informal as the desired tone for certain target languages (like Spanish, French, German, etc.).
- Profanity Masking: Can be enabled to replace profane words and phrases with the grawlix string "?$#@$".
Response Generation:
- For real-time requests, the translated text or document is returned synchronously in the API response.
- For batch jobs, the translated documents are written to a designated output S3 bucket, preserving the original file format.

This entire process is fully managed, meaning developers do not need to build, train, or manage any machine learning models themselves.

Key Features and Limits

Translation Modes: Supports both synchronous real-time translation for interactive use cases and asynchronous batch translation for large volumes of documents stored in Amazon S3.
Broad Language Support: Translates text between 75+ languages and variants.
Automatic Language Identification: Can automatically detect the source language when not specified.
Document Translation: Natively supports real-time and batch translation of various document formats (TXT, HTML, DOCX, PPTX, XLSX, XLIFF) while preserving formatting for Office documents.
Customization:
- Active Custom Translation (ACT): Use parallel data to adapt translations to your specific domain's terminology and style.
- Custom Terminology: Ensure brand names, character names, and other unique terms are always translated according to your rules.
- Formality Settings: Control the level of formality (formal or informal) for supported target languages.
- Profanity Masking: Automatically mask profane words and phrases in the translated output.
- Do-Not-Translate Tags: Use HTML tags to prevent specific portions of the content from being translated.
Security: All content processed by Amazon Translate is encrypted at rest and in transit. VPC endpoints are supported for private connections.

Service Limits (Quotas) as of 2026:

Real-Time (Synchronous):
- Maximum input text size: 10,000 bytes per request.
- Maximum document size: 100,000 bytes and 100,000 characters.
Batch (Asynchronous):
- Maximum total batch size: 5 GB.
- Maximum documents per batch: 1,000,000.
- Maximum size per document: 20 MB.
- Maximum characters per document: 1,000,000.
- Maximum concurrent jobs: 10.
Customization:
- Maximum custom terminology files: 100 per account per Region.
- Maximum parallel data resources: See documentation for current limits.

Common Use Cases

Content Localization: Batch translate websites, documentation, and marketing materials stored in S3 to reach a global audience.
Real-Time Communication: Enable multilingual live chat for customer support, allowing agents to communicate with customers in their native language.
Social Media and User-Generated Content Analysis: Translate streams of social media posts, product reviews, or forum comments into a single language for analysis with services like Amazon Comprehend.
E-commerce Globalization: Automatically translate product catalogs, descriptions, and customer reviews to facilitate cross-border sales.
Internal Communications: Translate internal documents, emails, and knowledge base articles to support a global workforce.

Pricing Model

Amazon Translate uses a pay-as-you-go pricing model based on the number of characters processed.

Free Tier: New AWS accounts receive 2 million characters per month free for the first 12 months for standard text translation. This does not apply to document translation.
Standard Translation: Billed per million characters for real-time text and batch translation of text or basic document formats (TXT, HTML). As of early 2026, the standard rate is $15 per million characters.
Document Translation (Office Formats): Translating DOCX, PPTX, or XLSX files with formatting preservation costs more than the standard rate. As of early 2026, this is priced at $30 per million characters.
Active Custom Translation (ACT): Using parallel data for customization is the most expensive tier. As of early 2026, this is priced at $60 per million characters.

There are no upfront costs or minimum fees. Pricing is consistent across most AWS Regions. For detailed and current pricing, always consult the official AWS Pricing Calculator.

Pros and Cons

Pros:

Deep Integration with AWS: Seamlessly connects with other AWS services like S3 for batch jobs, Lambda for event-driven translation, Comprehend for analysis, and Polly for text-to-speech.
Fully Managed: No infrastructure to manage or models to train, allowing developers to focus on application logic.
Scalable and Fast: The service is highly scalable and provides low-latency translations suitable for real-time applications.
Rich Customization: Features like Custom Terminology and Active Custom Translation provide significant control over the translation output without requiring deep ML expertise.

Cons:

Language Coverage: While extensive, it supports fewer languages (75+) than competitors like Google Cloud Translation (100+).
API-Only Interface: Lacks a built-in graphical user interface (GUI) for non-developers, making it primarily a tool for programmatic use.
Cost at Scale: While affordable for many use cases, high-volume translation, especially using ACT or document formatting, can become expensive.
Free Tier Limitation: The free tier expires after 12 months, unlike some competitors that offer a perpetual, albeit smaller, free tier.

Comparison with Alternatives

Amazon Translate vs. Google Cloud Translation API:
- Ecosystem: Translate is the natural choice for workloads already within the AWS ecosystem, while Google's API is native to Google Cloud Platform (GCP).
- Languages: Google generally supports a larger number of languages.
- Pricing (as of 2026): Amazon's standard rate ($15/M characters) is typically cheaper than Google's ($20/M characters). However, free tiers differ; Amazon's is larger but expires, while Google's is smaller but permanent.
Amazon Translate vs. Microsoft Translator Text API:
- Ecosystem: Microsoft's service is integrated into the Azure cloud.
- Pricing (as of 2026): Microsoft is often the most cost-effective option at scale, with a standard rate of $10 per million characters and the most generous permanent free tier (2 million characters/month).
- Features: All three major cloud providers offer a comparable set of core features like customization, document translation, and language detection.
Amazon Translate vs. Amazon Bedrock:
- Purpose: Amazon Translate is a specialized, purpose-built service optimized for high-quality, low-latency translation. Amazon Bedrock is a broader service providing API access to various general-purpose Large Language Models (LLMs) from providers like Anthropic and Meta.
- Use Case: For pure translation tasks, Amazon Translate is more efficient, feature-rich (e.g., Custom Terminology, ACT), and cost-effective. For multi-step, creative, or conversational tasks that might include translation as one step among others (like summarization and translation), an LLM via Bedrock could be a valid, albeit potentially more expensive and less optimized, choice.

Exam Relevance

Amazon Translate is a key service in the Machine Learning category and appears on several AWS certification exams.

AWS Certified Machine Learning - Specialty (MLS-C01): Candidates should understand the core use cases for Translate, how it integrates with other services (S3, Lambda, Comprehend), and particularly how to use its customization features like Custom Terminology and Active Custom Translation (Parallel Data).
AWS Certified Developer - Associate (DVA-C02): Developers should know how to call the Translate API (e.g., TranslateText), understand its basic request/response structure, and how to integrate it into an application, often using an AWS SDK.
AWS Certified Solutions Architect - Associate (SAA-C03): Architects should know when to select Amazon Translate as the appropriate service for localization and multilingual requirements in a solution design. Key considerations include its real-time vs. batch capabilities and its integration patterns with other AWS services.

Frequently Asked Questions

Q: What is the difference between Amazon Translate and Amazon Transcribe?

A: They are distinct services for different purposes. Amazon Translate converts text from a source language to a target language (e.g., English text to Spanish text). Amazon Transcribe converts audio (speech) into text (e.g., an MP3 file into a written transcript). They are often used together in a pipeline: first Transcribe converts foreign-language audio to text, then Translate converts that text into another language.

Q: Does Amazon Translate store my content?

A: Amazon Translate may store and use text inputs to provide, maintain, and improve the service and other Amazon ML/AI technologies. However, AWS provides sophisticated technical and physical controls, including encryption, to prevent unauthorized access. Customers concerned about data privacy can contact AWS Support to request that their data be deleted and not stored for future use.

Q: How do I handle text that is longer than the real-time API limit?

A: The synchronous TranslateText API has a limit of 10,000 bytes per request. For longer texts or documents, you have two primary options: 1) Split the text into smaller segments (like paragraphs) and make multiple TranslateText API calls, or 2) Use the asynchronous Batch Translation feature, which is designed for large documents and can process files up to 20 MB each and batches up to 5 GB.

This article reflects AWS features and pricing as of 2026. AWS services evolve rapidly — always verify against the official AWS documentation before making production decisions.

Amazon Translate: What It Is and When to Use It

Definition

How It Works

Key Features and Limits

Common Use Cases

Pricing Model

Pros and Cons

Comparison with Alternatives

Exam Relevance

Frequently Asked Questions

Q: What is the difference between Amazon Translate and Amazon Transcribe?

Q: Does Amazon Translate store my content?

Q: How do I handle text that is longer than the real-time API limit?

More in Machine Learning

Amazon Comprehend Medical: How It Works & When to Use It

SageMaker Ground Truth: Build ML Datasets Easily

Amazon CodeWhisperer: AI Coding Companion for Productivity

Amazon Augmented AI (A2I): How It Works & Use Cases

Bedrock Guardrails: Secure Your Generative AI Apps