I have been working on a weekend project called “jufsisku” for the past few weeks. This project is to build a search engine where you can look up Lojban-English translations using queries in these two languages. You can try out the search here: http://lojban.lilyx.net/jufsisku/ I have shown the demo to a group of Japanese-speaking lojbanist [...]
The paper we submitted to IJCNLP2011 has been accepted, and will be presented soon at the conference which will be held in a few weeks from now. The paper describes the #ANPI_NLP project, a voluntary relief project focusing on text and safety information mining in the wake of The East Japan Earthquake in March, 2011. [...]
My wife and I decided to provide a program named “Intensive Chinese Weekend Stay” in New York. In this program, we invite a learner of the Chinese language for free to our home and provide an intensive learning course. Intensive Chinese Weekend Stay Program (New York City) Part of the reasons why we started this [...]
On Labor day weekend, my wife and I paid a visit to Penn State University, which is located at State College, in the middle of the state of Pennsylvania. It was a four-and-a-half-hour bus ride from New York City, taking Megabus first and Gotobus for the return trip, which was not very comfortable. Our purpose [...]
Just for my convenience, I’ve listed up best papers of major NLP conferences (ACL / COLING / NAACL / EMNLP / CoNLL) for the past 7 years or so. If you find anything wrong or mistaken, please let me know. Thanks~ ACL 2005: David Chiang A hierarchical phrase-based model for statistical machine translation 2006: Rion [...]
The special issue “Unnatural Language Processing” of Journal of Natural Language Processing, for which I’m a leading editorial member, has started its call for paper a few weeks ago. This special issue, subtitled “Processing of Out-of-the-box Language Expressions” is the sequel to the past two events of “Unnatural Language Processing” last year. The topics include [...]
The Japanese morphological analyzer MeCab can also be directly called from Clojure, too, by using its Java binding. I have, however, come across some pitfalls related to JNI in the process, so I’ll describe how I’ve overcome them in the following so that everyone else doesn’t have to stumble over the same issues. The first [...]
We’ve been on a short trip to Toronto over the weekend, visiting my wife’s old friends, one of whom is now spending a week in her hometown. We’ve been to Niagara falls, downtown Toronto (ex-world-tallest CN Tower was amazing, and Cosa Loma castle was fun), had a BBQ at their wonderful house, and even enjoyed [...]
I recently found out that tatoeba.org is a pretty nice resource for collecting parallel text in many languages. The major reason why I love it is that the whole data is downloadable as a dump file, with all the sentences being under the creative commons license (although there are some mistakes in the sentences). Specifically, [...]
Last Wednesday, we held our first meet-up meeting of East Asian Language Learning through Interpretation Methods. The purpose is to brush up your language skills (we target at East Asian languages, namely Chinese, Japanese, and Korean) through interpretation methods. Although this was our first time to even set up a meetup group, it was quite [...]
About the Author
Masato Hagiwara currently works for Rakuten Institute of Technology in New York, as a Senior Scientist. Have worked on search technologies at Google, Microsoft Research, and Baidu in the past. Expert in Natural Language Processing (NLP). Also a lead translator of the O'Reilly book "Natural Language Processing in Python." A native speaker of Japanese. Good command of English and Chinese (Mandarin). For more information, see About Me.Pages
- 100 NLP Papers
- About Me
- iconlang – new ideographic writing system for better visibility and legibility
- iconlang – 視認性・識別性向上のための新しい表意文字体系
- Music
- Music for Language Fans
- NLTK Japanese Corpora – NLTKで使える日本語コーパス
- Python/Romkan ローマ字とひらがなを相互に変換する Python用のライブラリ
- TinySegmenter in Python
- 中国語学習完全ガイド | 1年以内にマスターする中国語
- 巻き舌クリニック – みんなで巻き舌を克服するサイト