OSDN > 브라 우즈 소프트웨어 > Multimedia > Sound/Audio > Speech > Julius > 다운로드 파일 목록 > 포장된 julius > Release 4.2.3

Julius

RSS Back to release list
Download List

프로젝트 설명

Julius is an open-source, high-performance large vocabulary continuous speech recognition (LVCSR) engine for speech-related researchs and developments. With HMM acoustic model and language model, you can construct your own speech recognition system.

Moved to github: https://github.com/julius-speech/julius

System Requirements

System requirement is not defined

Review

Average

4.5

32 Reviews

5 Stars	24
4 Stars	3
3 Stars	3
2 Stars	0
1 Star	2

Your rating

Review this project

dictation-kit

4.5

4.4.2.1a

4.4.2.1

4.4

4.3.1

4.2.3

4.2

4.1

4.0

3.2

3.1

3.0

2.2

grammar-kit

4.2

4.1

3.1

3.0

julius

4.4.2

4.4.1

4.4

4.3.1

4.3

4.2.3

4.2.2

4.2.1

4.2

4.1.5.1

4.1.5

4.1.4

4.1.3

4.1.2

4.1.1

4.1

4.0.2

4.0.1

4.0

3.5.3

3.5.2

3.5.1

3.5

3.4.2

3.4.1

3.4

3.3p4

3.3p3

free-3.3p2 multipath

free-3.3p2

3.1p2

Juliusbook

4.1.5

4.1.3

4.1.2

4.1-ja

sample_grammars

20050817

segmentation-kit

4.0

1.0-julius4

1.0

simpleword-kit

2.0

slf2dfa

1.0

wordflow

0.5

Released at 2013-06-30 00:00
julius 4.2.3 (3 files Hide)

Version 4.2.3 includes new features: dictionary reloading, audio input
scaling, long input rejection, minimum bayes risk decoding, and
character set conversion of binary N-gram. Several bugs has also been
fixed.

New options:
-lvscale
-rejectlong
-mbr, -nombr, -mbr_wwer, -mbr_weight

See the "Release.txt" file for the full list of updates.
Run with "-help" to see full list of options.

バージョン 4.2.3 で追加された新機能は、辞書の再読み込み、入力音声波形の
スケーリング、長時間入力の棄却、ベイズリスク最小化探索（南條浩輝氏，古
谷遼氏より提供）、およびバイナリN-gramの文字コード変換機能です。またい
くつかの不具合が修正されました。

新オプション：
-lvscale
-rejectlong
-mbr, -nombr, -mbr_wwer, -mbr_weight

変更点の詳細な一覧は Release-ja.txt をご覧ください．
また "-help" をつけて実行することでオプションの全リストが出力されます。

Changelog

English
日本語

New features:
- Add function "j_reload_adddict()" to reload dictionaries.
- Add option "-lvscale factor" and func "j_adin_change_input_scaling_factor()"
to scale the amplitude of captured audio by the factor.
- Add option "-rejectlong msec" to reject too long input.
- Add minimum bayes risk decoding, contributed by H. Nanjo and R. Furutani
- Support binary N-gram symbol charset conversion by "mkbingram".

Fixes:
- Fix sending audio stream via network with incorrect byte order at
big-endian machines.
- Fix occasional failure of closing audio device at j_close_stream().
- Fix segfault when reading binary hmm created at 64bit env. with embedded parameters.
- Fix memory leak when failed to read an N-gram file.
- Fix memory leak when input length overflow is detected.
- Fix unable to load feature vector plugin.
- Update microphone input code for recent MacOSX.

○新機能
- 辞書のリロード: j_reload_adddict()
- 音声入力の振幅のスケーリング："-lvscale factor" および
"j_adin_change_input_scaling_factor()"
- 長い入力の棄却："-rejectlong msec"
- バイナリN-gramの文字コード変換をサポート: "mkbingram"
- ベイズリスク最小化(MBR)探索の実装（南條浩輝氏，古谷遼氏より提供）
MBR Extension contributed by Hiroaki Nanjo and Ryo Furutani.

○修正されたバグ
- ビッグエンディアンマシンでadinnetで音声をネットワーク送信するときに
エンディアンが異なっていた問題
- j_close_stream() で音声デバイスのクローズにたまに失敗する
- 64bit環境で変換されたパラメータ埋め込み付きバイナリHMMが読み込めない問題
- N-gram ファイル読み込み失敗時のメモリリーク
- 入力長オーバーフロー時のメモリリーク
- 特徴ベクトル呼び込みプラグインが動作しない
- 最新のMacOSXでコンパイルに失敗する