jdorama.com Forum Index
 FAQFAQ   SearchSearch   MemberlistMemberlist   RegisterRegister  Log inLog in 
Top 100
Top 100
Spring 2019   Summer 2019   Fall 2019   Winter 2020  
Programs that convert Kanji+Kana to Romaji
Goto page 1, 2  Next
 
Post new topic   Reply to topic    jdorama.com Forum Index -> Learn Japanese
View previous topic :: View next topic  
100C



Joined: 09 Dec 2003
Posts: 34
Location: Japan
Country: Japan

PostPosted: Sun Apr 18, 2004 3:25 pm    Post subject: Programs that convert Kanji+Kana to Romaji Reply with quote Back to top

Probably, you want to translate English into Japanese with machine translation service or translation software. However, they are outputted in Kanji+Kana sentence. In order to know how it should be pronounced, you have to consult a dictionary repeatedly.

But, I found there are application programs which automates it.
One of them is a browser of shareware. The another is free software which needs a little knowledge of DOS usage.


1.Browser
HIRAGANA Navi.V3 http://kids.knowledgewing.com/

Originally, it is developed for the schoolchild who is learning Kanji, and has the function to change Kanji into Kana. Moreover, there is also a function changed into Roman alphabet.
Operation is easy, but required to pay 4,000 yen for using it exceeding the trial period for 20 days.


2.Free tool
KAKASI - Kanji Kana Simple Inverter http://kakasi.namazu.org/index.html.en

KAKASI is the language processing filter to convert Kanji characters to Hiragana, Katakana or Romaji and may be helpful to read Japanese documents.

You use it in command-prompt (DOS Window).
ex.
c:\kakasi\bin>kakasi -s -U -Ja -Ha -Ka < kanji.txt > romaji.txt

Input(kanji.txt):
KAKASI�͊������ȍ����蕶���Ђ炪�Ȃ⃍�[�}���ɕϊ����邱�Ƃ�ړI�Ƃ���

Output(romaji.txt):
KAKASI ha KANJI kana MAJI ri BUN wohiraganaya ro^ma JI ni HENKAN surukotowo MOKUTEKI toshite
Back to top
View user's profile Send private message
Wyckd



Joined: 12 Apr 2004
Posts: 41
Location: United States of Los Angeles
Country: United States

PostPosted: Mon Apr 19, 2004 3:53 pm    Post subject: Reply with quote Back to top

oh this is great. thanks!
Back to top
View user's profile Send private message
Azumi



Joined: 15 Apr 2004
Posts: 122


PostPosted: Sat May 01, 2004 10:40 pm    Post subject: Reply with quote Back to top

Thanks! I new about Kakasi, but never I managed to make it work with all those parameters. Now I can. Smile
For those looking for a more direct link to download Hiragana Navy: http://kids.knowledgewing.com/share/share1.html
Back to top
View user's profile Send private message
100C



Joined: 09 Dec 2003
Posts: 34
Location: Japan
Country: Japan

PostPosted: Sun May 02, 2004 2:24 pm    Post subject: Reply with quote Back to top

u r welcome.
and there is also web version of it.(it can convert text and webpage)

http://www.j-talk.com/nihongo/

�EKanji to spaced Hiragana
�EReadings in brackets
�EKanji to Romaji
�EDetailed word info(not websites)
�EKanji with rollover Hiragana, and translation in pop up!
Back to top
View user's profile Send private message
Azumi



Joined: 15 Apr 2004
Posts: 122


PostPosted: Mon May 03, 2004 1:18 am    Post subject: more software Reply with quote Back to top

I also had those software that looks like kakasi (command-line and free with source code). Never managed to make them work very well unless it's a text file.
Juman 4.0
ComCha
Chasen
Back to top
View user's profile Send private message
Sie_Kensou



Joined: 20 Jan 2004
Posts: 95


PostPosted: Fri Jun 18, 2004 1:55 am    Post subject: Reply with quote Back to top

i know kanji has different readings, especially the individual kanji characters when used with others for forming other words, they will have different readings, also in different contexts, they will have a different reading.. can these programs decipher?
Back to top
View user's profile Send private message
Azumi



Joined: 15 Apr 2004
Posts: 122


PostPosted: Sun Jun 20, 2004 2:29 am    Post subject: Reply with quote Back to top

They can if you use the "all" parameter. But, you still need to choose manually the one you want to keep. Sad It's a lot of work if you are not fluent in japanese.
Back to top
View user's profile Send private message
farfie



Joined: 27 Nov 2004
Posts: 9


PostPosted: Sat Nov 27, 2004 4:23 pm    Post subject: Question Reply with quote Back to top

How do I set up Kakasi? When I download the package there isn't any sort of a setup or installation program, and I don't know what to do...
Back to top
View user's profile Send private message Send e-mail Visit poster's website AIM Address
Azumi



Joined: 15 Apr 2004
Posts: 122


PostPosted: Sat Dec 04, 2004 5:10 am    Post subject: Reply with quote Back to top

There is no install. Just extract it to C:\Kakasi\ .It's the default path. To run it, you need to launch the application in c:\kakasi\bin\ using a command window.
You can also create a text file in c:\kakasi\bin, copy this text:
kakasi -s -U -Ja -Ha -Ka<kanji.txt> romaji.txt
and save the file as a .bat instead of .txt.
So, you just need to double-click the .bat file, instead of typing the big command line all the time. Smile

I'm not sure if I'm clear enough...

Good Luck!
Back to top
View user's profile Send private message
sunilkkn



Joined: 07 Dec 2004
Posts: 6
Location: tvm
Country: India

PostPosted: Tue Dec 07, 2004 7:01 pm    Post subject: Programs that convert Kanji+Kana to Romaji Reply with quote Back to top

I have visited many sites for the exe of Kakasi. But I didn't get it. Also when I extracted the source file, there is no bin folder is created. So can you send me the bin folder so that I can check Kakasi. Also I could not create the exe from the source files. It is not compiling successfully.

Also can You tell me about the configurations of kakasi so that I can run the source file extracted.

Heard that Kakasi itself tokenizing the kanji before it is translated in to romaji. Can you explain about it ?

Is there any APIs available for Kakasi ? also the exe of Kakasi is very urent. Can you send the exe or can you say the path. I am using windows OS and the binary files are not available in http://kakasi.namazu.org/ for windows

Also is there any programs for the conversion of hiragana and katakana to romaji ?

Thanks in Advance

Sunil
Back to top
View user's profile Send private message
groink



Joined: 01 Jan 1970
Posts: 1223


PostPosted: Wed Dec 08, 2004 3:12 am    Post subject: Re: Programs that convert Kanji+Kana to Romaji Reply with quote Back to top

sunilkkn wrote:
I have visited many sites for the exe of Kakasi. But I didn't get it. Also when I extracted the source file, there is no bin folder is created. So can you send me the bin folder so that I can check Kakasi. Also I could not create the exe from the source files. It is not compiling successfully.

Dude, you've got to put a little more effort into it...

http://www.namazu.org/win32/kakasi-2.3.4.zip

As for an API... There is a perl API being designed here:

http://search.cpan.org/dist/Text-Kakasi/

You can then use the perl API as a CGI, then use a web page as the user interface.

I'm going a similar route... I'm writing a BBcode MOD for phpBB (JDorama.com and d-addicts.com uses phpBB) to utilize kakasi. It can then be used by anyone in a posting. It'll look something like this in the phpBB editor:

Code:
This is a test:

[kakasi]KAKASI �́A�������Ȃ܂��蕶���Ђ炪�ȕ��⃍�[�}�����ɕϊ����邱�Ƃ�ړI�Ƃ��č쐬�����v���O�����Ǝ����̑��̂ł��B[/kakasi]



--- groink
Back to top
View user's profile Send private message
sunilkkn



Joined: 07 Dec 2004
Posts: 6
Location: tvm
Country: India

PostPosted: Wed Dec 08, 2004 3:04 pm    Post subject: Programs that convert Kanji+Kana to Romaji Reply with quote Back to top

Thanx groink. I got the executable of kakasi and it is running. Also the zip includes the libraries.

Kakasi is fine. i hope, I can use it for my requirement. My requirements are to convert kanji, kana to romaji and tokenizing the japanese string.

When I visited the url http://kakasi.namazu.org/, I got the source code of kakasi in C language. And in that source code, I found many main method.

For running Kakasi, whether we need only kakasi.c and kakasi.h ? I am asking about c, because I don't know perl. Do you know, how to make executable of kakasi from the c program ?

Thanx for the executable of kakasi.

Sunil
Back to top
View user's profile Send private message
sunilkkn



Joined: 07 Dec 2004
Posts: 6
Location: tvm
Country: India

PostPosted: Wed Dec 08, 2004 6:16 pm    Post subject: Programs that convert Kanji+Kana to Romaji Reply with quote Back to top

Hi,

I have run the executable of kakasi as follows :

kakasi -s -U -Ja -Ha -Ka <kanji.txt> romaji.txt

and the kanji.txt contains ����.

Its translation in romaji must be nichiyobi. but the text file romaji.txt contains the data

RU SUI—yu�eJAKU.

And for all translation, the romaji.txt contains RU SUI as the first characters. Anybody can explain why this hapens ?

Actually my requirement is to convert kanji characters and kana characters to romaji characters. Could you pls say the options that I need to add with kakasi.exe for this ?

sorry guys.. Actually I am not a japanese language expert. But this is for my project in which I need to translate the japanese text (in Kanji, hiragana and in katakana) to romaji. First time I execute the kakasi and got one output in the romaji.txt. I thought the output is correct. But my colleague, he knows japanese, told me that it is wrongly translated. This is only a sample text that I was tested.

So please help me

Thanks

Sunil
Back to top
View user's profile Send private message
100C



Joined: 09 Dec 2003
Posts: 34
Location: Japan
Country: Japan

PostPosted: Thu Dec 09, 2004 12:23 am    Post subject: Reply with quote Back to top

I am not familiar with program language. I had used google and found Kakasi. so, I could not explain these in detail.

Quote:
U SUI—yu�eJAKU�.


You may used Kanji code "UTF-8".
kakasi only supports with old JIS, new JIS, EUC, DEC, SHIFTJIS.
Not Unicode(like UTF-8 UTF-8N UTF-16....etc).
So you have to make input text file wiih these supported Kanji code.


and

Microsoft Agent http://www.microsoft.com/msagent/downloads/user.asp and Third-party's Reading up software.
Your PC speak Japanese. (need Japanese engine)
Back to top
View user's profile Send private message
groink



Joined: 01 Jan 1970
Posts: 1223


PostPosted: Thu Dec 09, 2004 3:40 am    Post subject: Reply with quote Back to top

100C wrote:
You may used Kanji code "UTF-8".
kakasi only supports with old JIS, new JIS, EUC, DEC, SHIFTJIS.
Not Unicode(like UTF-8 UTF-8N UTF-16....etc).
So you have to make input text file wiih these supported Kanji code.

That's exactly correct. Some text editors, especially notepad.exe, saves as UTF-8.

Also, if you download web pages to text files, make sure that the encoding method used to render these pages are Shift-JIS or similar. For Internet Explorer users, you can force this encoding mode on a particular page by selecting View -> Encoding -> Japanese (Shift-JIS). Then, once the page is re-rendered, you can save the page as a text file.

--- groink
Back to top
View user's profile Send private message
Azumi



Joined: 15 Apr 2004
Posts: 122


PostPosted: Thu Dec 09, 2004 10:47 am    Post subject: Reply with quote Back to top

You can get a free Japanese Word Processor (JWPce) at http://www.physics.ucla.edu/~grosenth/c_download.html
I tried ����, save it to Shift-JIS and kakasi converted it to: NICHI YOBI.
So, I guess it worked Smile
By typing kakasi -h, you can get all the options.
Back to top
View user's profile Send private message
sunilkkn



Joined: 07 Dec 2004
Posts: 6
Location: tvm
Country: India

PostPosted: Thu Dec 09, 2004 10:09 pm    Post subject: Tokenize japanese text Reply with quote Back to top

Yes. Thanx for your comments. Actually I have done the same. I saved the file kanji.txt with utf-8 format. that was why I got wrong answer. Thank you for your comments. Now I tested all the kanji texts with the kakasi front end and got the correct answer. Anybody know how to make the API (c and header files) runnable ?

For my project, I need to tokenize the japanese text which contains kanji, kana. Is there any method available for it ? I have browsed for the tokenizer. But failed to find a gud tokenizer. Anybody can help me ?

Once again thanx for your help...

Sunil
Back to top
View user's profile Send private message
groink



Joined: 01 Jan 1970
Posts: 1223


PostPosted: Fri Dec 10, 2004 4:14 am    Post subject: Re: Tokenize japanese text Reply with quote Back to top

sunilkkn wrote:
Yes. Thanx for your comments. Actually I have done the same. I saved the file kanji.txt with utf-8 format. that was why I got wrong answer. Thank you for your comments. Now I tested all the kanji texts with the kakasi front end and got the correct answer. Anybody know how to make the API (c and header files) runnable ?

I'd first off study the perl interface I posted earlier. See how that interface communicates with kakasi, then possibly reverse engineer the code and re-work it in C++. My C++ is pretty weak seeing I haven't worked with that language in eons, so I can't help you there.

--- groink
Back to top
View user's profile Send private message
sunilkkn



Joined: 07 Dec 2004
Posts: 6
Location: tvm
Country: India

PostPosted: Tue Dec 14, 2004 2:27 pm    Post subject: Tokenize japanese text Reply with quote Back to top

Hi,

I got Juman 4.0 for tokenizing the japanese text. Both executable and source code. But the documentaion about how to run is not so understandable. When I execute the application, the following out put occurring.

C:\Program Files\juman>juman -b -e -r kanji.jumanrc
��╗����&����&&��&&��&&��&&��&₧
is not list. in <car>

parse error between line 819 and 819.
exit(5)

and the kanji.jumanrc contains the following text

�]�����̕�

I can't find why this happens ?

Pls help me. anybody know the way to run juman.exe ?

Thanks & Regards

Sunil
Back to top
View user's profile Send private message
100C



Joined: 09 Dec 2003
Posts: 34
Location: Japan
Country: Japan

PostPosted: Wed Dec 15, 2004 1:26 am    Post subject: Reply with quote Back to top

I do not understand most meanings of the word "tokenizer." However, I can read Japanese.
(It is hard to read English for me.)

Juman reads from Standard-In and outputs to Standard-Out.
It understands only SJIS among some kanji codes.
Therefore, when it was operated as follows, it operated normally.

Code:
C:\Program Files\juman>juman -b
脳動脈の閉塞
脳          (のう)       脳         普通名詞
動脈        (どうみゃく) 動脈       普通名詞
の          (の)         の         接続助詞
閉塞        (へいそく)   閉塞       サ変名詞
EOS
^Z

C:\Program Files\juman>juman -b < input.txt
脳          (のう)       脳         普通名詞
動脈        (どうみゃく) 動脈       普通名詞
の          (の)         の         接続助詞
閉塞        (へいそく)   閉塞       サ変名詞
EOS

C:\Program Files\juman>juman -b < input.txt > output.txt

C:\Program Files\juman>type output.txt
脳          (のう)       脳         普通名詞
動脈        (どうみゃく) 動脈       普通名詞
の          (の)         の         接続助詞
閉塞        (へいそく)   閉塞       サ変名詞
EOS

C:\Program Files\juman>
Back to top
View user's profile Send private message
Display posts from previous:   
   
Post new topic   Reply to topic    jdorama.com Forum Index -> Learn Japanese All times are GMT + 8 Hours
Goto page 1, 2  Next
Page 1 of 2

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum