Skip to content

M4t1ss/Multi-System-Hybrid-Translator

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Syntax-based Multi-System Hybrid Translator

This is a hybrid solution for acquiring the best translation out of multiple online MT engines

Included software

Requirements

Supported APIs

  • Google Translate
  • Bing Translator
  • LetsMT

Usage

The SyMHyT requires three parameters - the language model, input sentences, grammar file. It is run with the following command:

php SyMHyT.php <language model> <input sentences> <grammar>

For example:

php SyMHyT.php languageModel.binary inputSentences.txt eng_sm6.gr

The output generates four three files:

  • output.google.txt
  • output.bing.txt
  • output.letsmt.txt
  • output.hybrid.txt

Utils

The utils directory contains separate parts of the SyMHyT system that can be run as standalone

  • utils/chunking/ contains files for individual chunking and unchunking

    • to parse an input file with the Berkeley Parser (a parsed file is required as input for the chunker) run:
     java -Xmx1024m -jar BerkeleyParser-1.7.jar -gr grammar.gr < input.txt
    
  • utils/chunks_to_translated_chunks/ contains files for individual translating of chunked files

  • utils/translated_chunks_to_hybrid/ contains files for running the hybrid system with chunked translated files

About

A hybrid machine translation solution that employs a language model and online translation APIs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published