Back to Blog
Cloud Computing

Amazon Translate: The Complete Guide to AWS Translation

Amazon Translate provides neural machine translation across 75 languages with 5,550 language pair combinations, custom terminology, Active Custom Translation, and document formatting preservation. This guide covers real-time and batch translation, customization features, pricing, and a comparison with Google Cloud Translation.

Cloud Computing
Service Deep Dive
13 min read
4 views

What Is Amazon Translate?

Inevitably, reaching global audiences means communicating in their language — product descriptions, customer support conversations, legal documents, user interfaces, and marketing content all need localization. Traditionally, this required either expensive human translation agencies or building custom ML translation pipelines. Amazon Translate eliminates both barriers with neural machine translation that works at scale.

Amazon Translate is a fully managed neural machine translation service from Amazon Web Services that delivers fast, high-quality, and affordable language translation. Currently, it supports 75 languages and over 5,550 language pair combinations, handles both real-time and batch translation, and preserves document formatting during translation — making it suitable for everything from real-time customer chat localization to high-volume bulk document processing across global markets.

Importantly, Amazon Translate goes beyond basic word-for-word translation. Specifically, its deep learning models consider the entire context of each source sentence to generate fluent, natural-sounding translations. Furthermore, Custom Terminology ensures that brand names, product names, and industry-specific terms translate consistently, while Active Custom Translation (ACT) lets you customize the translation engine with your own parallel data for domain-specific accuracy improvements — all without training or maintaining custom ML models.

Amazon Translate Capabilities Overview

75 languages
Supported Languages
5,550 pairs
Language Combinations
2M chars
Free Tier (Monthly)

Moreover, Amazon Translate integrates natively with the broader AWS ecosystem — S3 for document storage, Lambda for event-driven translation workflows, Comprehend for text analysis of translated content, and Transcribe for speech-to-text in multilingual pipelines. Consequently, you can build complete multilingual processing workflows entirely within AWS.

Key Takeaway

Amazon Translate provides neural machine translation across 75 languages with custom terminology support, document formatting preservation, and deep AWS integration. If your organization needs to localize content, translate documents, or power multilingual applications at scale, Translate is the fastest path to production-grade translation on AWS.


How Amazon Translate Works

Fundamentally, Essentially, Amazon Translate operates as a serverless API service. Simply send text in a source language (or let Translate auto-detect it), specify the target language, and receive the translated text in return. Under the hood, the service uses neural machine translation models trained on a vast, diverse corpus of multilingual data to understand context and produce fluent translations.

Real-Time Translation with Amazon Translate

For immediate translation needs, the synchronous TranslateText API processes up to 10,000 bytes of text per request and returns results in real time. Consequently, this mode powers use cases like live chat translation, dynamic website localization, and real-time content rendering. Additionally, the Real-Time Document Translation API translates entire documents (TXT, HTML, DOCX) while preserving the original formatting — so translated Word documents maintain their layout, fonts, and structure.

Batch Translation with Amazon Translate

For high-volume translation workloads, the asynchronous TextTranslationJob API processes entire collections of documents stored in S3. Essentially, you submit a batch job specifying source and target languages, input and output S3 locations, and Translate processes all files automatically. Subsequently, results are delivered to the specified output bucket when processing completes. Importantly, batch translation supports Word (DOCX), PowerPoint (PPTX), Excel (XLSX), plain text, and HTML documents — making it ideal for bulk localization of product catalogs, documentation libraries, and marketing content.

Customization Features in Amazon Translate

Currently, Amazon Translate provides two customization mechanisms that significantly improve translation quality for domain-specific content:

  • Custom Terminology: Essentially, upload a glossary of terms that must translate in a specific way — brand names, product names, acronyms, and industry jargon. Subsequently, Translate uses these terms exactly as specified, overriding the neural model’s default translation. Importantly, supports up to 10,000 terms per terminology file at no additional cost beyond the standard per-character translation pricing.
  • Active Custom Translation (ACT): Alternatively, provide parallel data — examples of source-target translation pairs — along with your batch translation job. Subsequently, Translate uses this data to customize the translation output at runtime, improving accuracy for your specific domain without requiring you to build, train, deploy, or maintain any custom translation model. Specifically, particularly valuable for legal, medical, technical, and financial content where standard translations may miss domain-specific nuance.

Core Amazon Translate Features

Beyond the translation APIs and customization options, several capabilities make Amazon Translate particularly versatile and well-suited for enterprise-scale multilingual deployment:

Neural Machine Translation
Deep learning models consider entire sentence context to produce fluent, natural-sounding translations. Significantly more accurate than traditional statistical or rule-based translation approaches.
Document Format Preservation
Translates DOCX, PPTX, XLSX, HTML, and TXT files while preserving original formatting, layout, fonts, and structure. Eliminates the need to reformat documents after translation.
Automatic Language Detection
Automatically detects the source language when not specified, supporting multilingual content pipelines where the input language may vary. Works seamlessly with Amazon Comprehend’s language detection for enhanced accuracy.
Custom Terminology
Ensure consistent translation of brand-specific terms, product names, and industry jargon. Upload glossary files at no additional cost — terms are applied during translation without overriding the neural model for other content.
Active Custom Translation
Provide parallel translation examples to customize batch translation output for your specific domain. Improves accuracy for specialized content (legal, medical, technical) without building custom models.
Profanity Masking
Optionally mask profane words and phrases in translation output, replacing them with a generic string. Useful for content platforms and customer-facing applications where inappropriate language must be filtered.

Need Multilingual Content at Scale?
Our AWS team designs and deploys Translate-powered localization pipelines for global content


Amazon Translate Pricing Model

Fundamentally, Fundamentally, Amazon Translate uses pay-per-character pricing with no minimum commitments. Rather than listing specific dollar amounts that change over time, here is how the cost structure works and what to watch for when estimating monthly spend:

Understanding Amazon Translate Costs

  • Standard translation: Charged per character processed (including whitespace and punctuation). Importantly, the same per-character rate applies to both real-time and batch translation. Pricing is flat regardless of volume — there are no tiered discounts for higher usage.
  • Active Custom Translation: Charged at a higher per-character rate than standard translation, reflecting the runtime customization overhead. Free tier of 500,000 characters per month for the first 2 months only.
  • Custom Terminology: Notably, no additional charge for using Custom Terminology — the cost is included in the standard per-character rate. However, parallel data storage for ACT is charged at a small per-GB monthly rate beyond the 200 GB free allowance.
  • Free tier: 2 million characters per month of standard translation for the first 12 months. Generous enough for development, testing, and moderate production workloads.
Cost Optimization Strategy

Critically, preprocess text before translation to remove unnecessary whitespace, normalize formatting, and strip tags that Translate will ignore — savings of 10-20% are common for heavily formatted documents. Cache translated content to avoid re-translating static text. When the source language is known, specify it explicitly to avoid the additional cost of automatic language detection via Comprehend. For current pricing, see the official Translate pricing page.


Real-World Amazon Translate Use Cases

Given its versatility across 75 languages, Amazon Translate serves organizations with global content and communication needs. Below are the use cases we implement most frequently for our enterprise clients with global content requirements:

E-Commerce Product Localization
Translate product titles, descriptions, reviews, and category pages across target markets. Batch translation processes entire product catalogs while Custom Terminology ensures brand names and product terms translate consistently.
Multilingual Customer Support
Translate customer service chat messages in real time, enabling agents to support customers in any language. Combine with Amazon Transcribe for multilingual voice support and Comprehend for sentiment analysis of translated conversations.
Document Translation at Scale
Translate internal documentation, compliance materials, training manuals, and legal documents across languages while preserving formatting. Batch processing handles thousands of documents per job.
Website and Application Localization
Dynamically translate website content, application interfaces, and user-generated content for international audiences. Real-time API integration enables on-the-fly translation as content is created or requested.
Social Media and Content Analytics
Translate social media posts, reviews, and comments from multiple languages into a single language for unified sentiment analysis via Amazon Comprehend. Enables global brand monitoring across language barriers.
Healthcare and Legal Translation
Translate patient communications, medical documentation, and legal filings with Active Custom Translation for domain-specific accuracy. Custom Terminology ensures medical and legal terms translate correctly and consistently.

Amazon Translate vs Google Cloud Translation

If you are evaluating translation services across cloud providers, here is how Amazon Translate compares directly with Google Cloud Translation API:

Capability Amazon Translate Google Cloud Translation
Language Support Yes — 75 languages, 5,550 pairs ✓ 130+ languages
Neural Translation ✓ Context-aware NMT Yes — Neural Machine Translation
Custom Terminology ✓ Free, up to 10K terms Yes — Glossaries (Advanced API)
Custom Models Yes — Active Custom Translation (ACT) Yes — AutoML Translation
Document Translation ✓ DOCX, PPTX, XLSX, HTML, TXT Yes — PDF, DOCX, PPTX, XLSX
Batch Processing ✓ S3-based batch jobs Yes — Cloud Storage-based batch
Free Tier ✓ 2M chars/month (12 months) ◐ 500K chars/month
Profanity Masking ✓ Built-in ✕ Not available
Ecosystem Integration Yes — S3, Lambda, Comprehend, Transcribe Yes — Cloud Storage, Functions, NL API

Choosing the Right Amazon Translate Alternative

Clearly, both services offer strong neural machine translation. Ultimately, your cloud ecosystem determines the best fit. If you build on AWS, Translate’s native integration with S3, Lambda, Comprehend, and Transcribe makes it the natural choice for building complete multilingual processing pipelines. Conversely, if your infrastructure runs on Google Cloud, Cloud Translation integrates natively with Cloud Storage and Natural Language API.

Notably, Google offers broader language coverage (130+ vs 75 languages) and Google AutoML Translation provides more advanced custom model training. However, Amazon Translate differentiates with a significantly more generous free tier (2M vs 500K characters/month), built-in profanity masking, and the ACT feature that customizes translations at runtime without requiring separate model training. Furthermore, Custom Terminology is included at no additional cost on Amazon Translate, while Google’s equivalent glossary feature requires the Advanced API tier.


Getting Started with Amazon Translate

Fortunately, Amazon Translate requires no setup. You call the API with text and receive translated text immediately. The free tier provides 2 million characters per month for the first 12 months — enough for significant development, testing, and moderate production workloads.

Your First Amazon Translate API Call

Below is a minimal Python example that translates English text to Spanish:

import boto3

# Initialize the Translate client
client = boto3.client('translate', region_name='us-east-1')

# Translate text
response = client.translate_text(
    Text='Welcome to our service. How can we help you today?',
    SourceLanguageCode='en',
    TargetLanguageCode='es'
)

print(f"Translated: {response['TranslatedText']}")
# Output: "Bienvenido a nuestro servicio. ¿Cómo podemos ayudarle hoy?"

Subsequently, for batch document translation, use the start_text_translation_job API to process entire S3 folders of documents. For document translation with formatting preservation, use the translate_document API. For more details, see the Amazon Translate documentation.


Amazon Translate Best Practices and Pitfalls

Advantages
75 languages with 5,550 translation pair combinations
Document translation preserves DOCX, PPTX, XLSX formatting
Custom Terminology included at no additional cost
Active Custom Translation enables domain-specific accuracy
Generous 2M character/month free tier for 12 months
Deep AWS integration with S3, Lambda, Comprehend, Transcribe
Limitations
Fewer languages than Google Cloud Translation (75 vs 130+)
No volume-tiered pricing — flat rate regardless of usage
Whitespace and punctuation count toward billable characters
ACT free tier is only 2 months (vs 12 months for standard)
Translation quality can be literal for some language pairs

Recommendations for Amazon Translate Deployment

  • First, define Custom Terminology before translating: Upload brand names, product names, and industry-specific terms into a terminology file. This single step prevents the most common translation errors — your brand appearing as a generic translated word instead of staying as-is.
  • Additionally, preprocess text to reduce character count: Specifically, remove unnecessary whitespace, strip HTML tags for non-HTML content, and normalize formatting before submitting text for translation. Since every character (including spaces) counts toward billing, preprocessing can reduce costs by 10-20%.
  • Furthermore, cache translated static content: For content that does not change frequently (product descriptions, UI strings, help documentation), translate once and cache the results. Avoid re-translating the same content on every request.
  • Moreover, use ACT for domain-specific quality: For specialized content (legal, medical, technical), provide parallel data through Active Custom Translation. The quality improvement for domain-specific terminology is significant and often eliminates the need for post-editing by human translators.
  • Finally, combine with Comprehend for multilingual analytics: Translate international content into a single language, then analyze it with Comprehend for sentiment, entities, and key phrases. Consequently, this combination enables unified global brand monitoring and customer analytics across language barriers.
Key Takeaway

Amazon Translate breaks language barriers at scale — translating content across 75 languages with neural accuracy, custom terminology preservation, and document formatting retention. The key to maximizing value is preprocessing text to minimize character counts, caching static translations, and using Custom Terminology and ACT for domain-specific quality. An experienced AWS partner can help you design multilingual content pipelines that balance translation quality, cost, and automation.

Ready to Go Global?
Let our AWS team build multilingual content pipelines powered by Amazon Translate


Frequently Asked Questions About Amazon Translate

Common Questions Answered
What is Amazon Translate used for?
Essentially, Amazon Translate is used for converting text from one language to another using neural machine translation. Common use cases include e-commerce product localization, multilingual customer support chat translation, bulk document translation (Word, PowerPoint, Excel, HTML), website and application localization, social media analytics across languages, and healthcare and legal document translation. It supports 75 languages with over 5,550 language pair combinations.
Is Amazon Translate free?
Indeed, Amazon Translate offers a free tier providing 2 million characters per month of standard translation for the first 12 months. Active Custom Translation includes a separate free tier of 500,000 characters per month for the first 2 months only. Beyond the free tier, standard translation uses flat per-character pricing with no volume discounts. Custom Terminology is included at no additional cost.
What is the difference between Amazon Translate and Amazon Transcribe?
Fundamentally, they serve entirely different functions despite their similar names. Amazon Translate converts text from one language to another (e.g., English text → Spanish text). Amazon Transcribe converts speech to text (e.g., audio recording → written transcript). Consequently, Essentially, Translate handles language translation while Transcribe handles speech recognition. They are often used together in multilingual pipelines — Transcribe converts spoken audio to text, then Translate converts that text into the target language.

Technical and Quality Questions

How accurate is Amazon Translate?
Naturally, accuracy varies by language pair and content domain. Generally, for common language pairs (English-Spanish, English-French, English-Portuguese), user reviews report strong accuracy competitive with Google Translate and DeepL. For specialized domains (legal, medical, technical), using Custom Terminology and Active Custom Translation significantly improves accuracy. However, some users note that translations can be too literal for certain language pairs. Therefore, always test with your specific content and language pairs before deploying to production.
Can Amazon Translate preserve document formatting?
Yes. Indeed, Amazon Translate supports real-time document translation that preserves the original formatting of DOCX, HTML, and TXT files. For batch processing, it also handles PPTX and XLSX files while maintaining layout, fonts, and structure. Consequently, this eliminates the manual reformatting work that is typically required after machine translation of formatted documents.
Weekly Briefing
Security insights, delivered Tuesdays.

Join 1 million+ security professionals. Practical, vendor-neutral analysis of threats, tools, and architecture decisions.