I am loading the CMU 64000 word 3-gram language model for English, and it works fine but,
of course, loads slowly in its ARPA text format. The file is successfully converted by "mkbingram -nlr cmu.lm cmu.bin"
but the resulting binary version causes a segmentation fault during loading:
#0 0xb7e757e1 in free () from /lib/i386-linux-gnu/i686/cmov/libc.so.6
#1 0x08099ce1 in ngram_info_free (ndata=0x871e970)
at src/ngram/ngram_malloc.c:84
#2 0x08063395 in initialize_ngram (lmconf=0x82c5718, winfo=0x86d9300)
at src/m_fusion.c:459
#3 0x08063697 in j_load_lm (recog=0x82c5c78, lmconf=0x82c5718)
at src/m_fusion.c:641
#4 0x080637d1 in j_load_all (recog=0x82c5c78, jconf=0x82c53f0)
at src/m_fusion.c:715
#5 0x0804a183 in main (argc=3, argv=0xbffff674) at main.c:140
(gdb) p ndata
$1 = (NGRAM_INFO *) 0x871e970
(gdb) p *ndata
$2 = {n = 0, dir = 0, from_bin = 0 '\000', bigram_index_reversed = 0 '\000',
I am loading the CMU 64000 word 3-gram language model for English, and it works fine but, of course, loads slowly in its ARPA text format. The file is successfully converted by "mkbingram -nlr cmu.lm cmu.bin" but the resulting binary version causes a segmentation fault during loading:
#0 0xb7e757e1 in free () from /lib/i386-linux-gnu/i686/cmov/libc.so.6 #1 0x08099ce1 in ngram_info_free (ndata=0x871e970)
#2 0x08063395 in initialize_ngram (lmconf=0x82c5718, winfo=0x86d9300)
#3 0x08063697 in j_load_lm (recog=0x82c5c78, lmconf=0x82c5718)
#4 0x080637d1 in j_load_all (recog=0x82c5c78, jconf=0x82c53f0)
#5 0x0804a183 in main (argc=3, argv=0xbffff674) at main.c:140
(gdb) p ndata $1 = (NGRAM_INFO *) 0x871e970 (gdb) p *ndata $2 = {n = 0, dir = 0, from_bin = 0 '\000', bigram_index_reversed = 0 '\000',
(gdb)