What our clients are saying
"Baobab creates high quality image annotation data sets for us according to various different requirements. Furthermore, since the annotators are individually managed, we also entrust the company with the annotation of highly sensitive data."
"I have asked Baobab to create data for my research many times, and I really appreciate their willingness and flexibility in responding even to slightly unusual requests. I thoroughly recommend them."
BAOBAB is a website designed to create translations and linguistic data by cooperating as a group and using machine learning in order to broaden communication and services all over the world. BAOBAB
Learning Data for Machine Translation
Our company began as a service that could offer the extremely large volumes of textual learning data needed for machine translation faster and at a more reasonable price than anywhere else.
Image annotation and tagging/
voice data collection
- Image annotation and tagging
- Image captioning
- Voice transcription
Using Moringa, a mobile app developed by BAOBAB, staff all around the world can collect and tag images, and collect multilingual speech utterances/sounds.
- Moringa-i, an image collection and tagging tool
- Moringa-v, a voice data collection and tagging tool
Creating bilingual scenarios
in multiple languages
- Data for dialogue scenarios, read aloud by native speakers.
- Simulated conversations between 2 speakers, speaking freely on a predetermined setting.
Click here for sample dialogue scenarios
Developing machine translation engines specialised for particular areas
We develop machine translation engines specialised for particular areas, and provide them as an API. We undertake everything from the creation of learning data, to the development of machine translation engines, and human-powered evaluation of the resulting translations.
- A machine translation engine that specialises in recipes (Japanese ⇄ English)
We are pleased to announce the appointment of Dr Graham Neubig from the Carnegie Mellon University Language Technology Institute as our adviser on April 1st, 2017.
We released image/voice data collection and tagging tool "moringa".
Sponsoring the 26th International Conference on Computational Linguistics (Coling 2016)
Sponsored the 22nd Annual Conference of Natural Language Processing (NLP2016)
Baobab CEO Miori Sagara was appointed as a delegate of the Association for Natural Language Processing
Sponsored the 53rd Annual Meeting of the Association for Computational Linguistics (ACL2015)
Began an Image Gathering and Annotation Service
Sponsored the 25th International Conference on Computational Linguistics (Coling 2014)
Aided in the 25th International Conference on Computational Linguistics (Coling 2014)
Release of the updated version of “Yoko’s Yummy Recipes” An iPhone/Android translation application designed specifically for translating recipes
Worked as a guest lecturer at Meiji University.
Aided with the 19th Annual Language Processing Academic Convention (NLP2013)