Powering the future through the creation of high-quality, large scale, multilingual learning data for AI/machine learning

Contact us

What our clients are saying

"Baobab creates high quality image annotation data sets for us according to various different requirements. Furthermore, since the annotators are individually managed, we also entrust the company with the annotation of highly sensitive data."

"I have asked Baobab to create data for my research many times, and I really appreciate their willingness and flexibility in responding even to slightly unusual requests. I thoroughly recommend them."

Our services

BAOBAB

BAOBAB is a website designed to create translations and linguistic data by cooperating as a group and using machine learning in order to broaden communication and services all over the world. BAOBAB

Learning Data for Machine Translation

Our company began as a service that could offer the extremely large volumes of textual learning data needed for machine translation faster and at a more reasonable price than anywhere else.

Image annotation and tagging/
voice data collection

  • Image annotation and tagging
  • Image captioning
  • Voice transcription
We have created a special in-house tool for image annotation, ensuring speedy and accurate results.

moringa

Using Moringa, a mobile app developed by BAOBAB, staff all around the world can collect and tag images, and collect multilingual speech utterances/sounds.

  • Moringa-i, an image collection and tagging tool
  • Moringa-v, a voice data collection and tagging tool
moringa

Creating bilingual scenarios
in multiple languages

  • Data for dialogue scenarios, read aloud by native speakers.
  • Simulated conversations between 2 speakers, speaking freely on a predetermined setting.
We create transcribed voice data and written transcriptions of the above.
Click here for sample dialogue scenarios

Developing machine translation engines specialised for particular areas

We develop machine translation engines specialised for particular areas, and provide them as an API. We undertake everything from the creation of learning data, to the development of machine translation engines, and human-powered evaluation of the resulting translations.
Example:

  • A machine translation engine that specialises in recipes (Japanese ⇄ English)
Yoko's Yummy Recipes (iOS / Android)


News
2017.04.01

We are pleased to announce the appointment of Dr Graham Neubig from the Carnegie Mellon University Language Technology Institute as our adviser on April 1st, 2017.

2016.12.11

We released image/voice data collection and tagging tool "moringa".

2016.07.10

Sponsoring the 26th International Conference on Computational Linguistics (Coling 2016)

2016.03.07

Sponsored the 22nd Annual Conference of Natural Language Processing (NLP2016)

2015.12.22

Baobab CEO Miori Sagara was appointed as a delegate of the Association for Natural Language Processing

2015.07.27

Sponsored the 53rd Annual Meeting of the Association for Computational Linguistics (ACL2015)

2015.04.01

Began an Image Gathering and Annotation Service

2014.08.23

Sponsored the 25th International Conference on Computational Linguistics (Coling 2014)

2014.07.15

Aided in the 25th International Conference on Computational Linguistics (Coling 2014)

2014.06.09

Release of the updated version of “Yoko’s Yummy Recipes” An iPhone/Android translation application designed specifically for translating recipes

2013.11.27

Worked as a guest lecturer at Meiji University.

2013.03.12

Aided with the 19th Annual Language Processing Academic Convention (NLP2013)