Skip to content
Merged
Show file tree
Hide file tree
Changes from 6 commits
Commits
Show all changes
58 commits
Select commit Hold shift + click to select a range
317a178
adding ja
BuyuanCui Feb 21, 2024
31dae98
adding ja
BuyuanCui Feb 21, 2024
43a94d5
adding ja
BuyuanCui Feb 21, 2024
2bab707
adding ja gramamrs
BuyuanCui Feb 21, 2024
82555e9
adding ja tests
BuyuanCui Feb 21, 2024
e6111f6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 21, 2024
cb24883
format fixs
BuyuanCui Feb 21, 2024
3931987
Merge branch 'jp_itn_20240221' of https://github.com/NVIDIA/NeMo-text…
BuyuanCui Feb 21, 2024
e1e10a6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Feb 21, 2024
eed0c27
removing commented out tests
BuyuanCui Feb 22, 2024
d80789b
Merge branch 'jp_itn_20240221' of https://github.com/NVIDIA/NeMo-text…
BuyuanCui Feb 22, 2024
6e26901
updated for japanese
BuyuanCui Feb 23, 2024
b610752
added japanese for tests
BuyuanCui Feb 23, 2024
e3bba6f
Update Dockerfile
BuyuanCui Apr 17, 2024
21e410f
Update export_grammars.sh
BuyuanCui Apr 17, 2024
b53e864
Update pynini_export.py
BuyuanCui Apr 17, 2024
ce6b44b
Update sh_test.sh
BuyuanCui Apr 17, 2024
d31d854
Update launch.sh
BuyuanCui Apr 17, 2024
5914183
Update test_sparrowhawk_inverse_text_normalization.sh
BuyuanCui Apr 17, 2024
c6e7a15
restyle
BuyuanCui Apr 19, 2024
c5e7c87
updates
BuyuanCui May 1, 2024
2478a93
Merge branch 'jp_itn_20240221' of https://github.com/NVIDIA/NeMo-text…
BuyuanCui May 7, 2024
14d2dee
updates
BuyuanCui May 8, 2024
21ffb37
configuration issue
BuyuanCui May 8, 2024
61ffad0
copied from main
BuyuanCui May 8, 2024
37bc761
copied from main
BuyuanCui May 8, 2024
abba1a8
copied from main;
BuyuanCui May 8, 2024
203865f
copied from main
BuyuanCui May 8, 2024
3f43d2c
typo
BuyuanCui May 15, 2024
1320564
typo
BuyuanCui May 15, 2024
b227b07
copied from main
BuyuanCui May 15, 2024
9b90d24
space issue
BuyuanCui May 22, 2024
c8642b9
space issue
BuyuanCui May 22, 2024
dc7f74a
space issue
BuyuanCui May 22, 2024
c06b3bf
space issue
BuyuanCui May 22, 2024
51bb8f6
space issue
BuyuanCui May 22, 2024
9b38fbd
space issue
BuyuanCui May 22, 2024
6915cfd
space issues
BuyuanCui May 22, 2024
32348b3
updated fraction tagger and test cases
BuyuanCui Jul 9, 2024
79a0e97
changed data format for corrections
BuyuanCui Jul 9, 2024
2d8e3cd
added back all 6 tests
BuyuanCui Jul 9, 2024
2048a60
added ja on line 168
BuyuanCui Jul 9, 2024
8d7ab86
updated the grammar to resolve empty sparrohawt test result
BuyuanCui Jul 9, 2024
3ed3bd1
updated for empty SH test result
BuyuanCui Jul 9, 2024
d654198
updates for japanese on dates and tests
BuyuanCui Jul 9, 2024
97b1bd9
Merge branch 'main' into jp_itn_20240221
BuyuanCui Jul 9, 2024
10bb48c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jul 9, 2024
4334750
added ITNPostProcessingFst as None
BuyuanCui Jul 10, 2024
d63d7b3
Update pynini_export.py added ITNPOstProcessorFst as None
BuyuanCui Jul 10, 2024
2df505e
Update ordinal.py remioviung unused imports
BuyuanCui Jul 10, 2024
3df7d0b
Update ordinal.py removing unnecessary comments
BuyuanCui Jul 10, 2024
15274ef
Merge branch 'jp_itn_20240221' of https://github.com/NVIDIA/NeMo-text…
BuyuanCui Jul 15, 2024
f57a8ac
removing unsed import
BuyuanCui Jul 15, 2024
e16954c
update for japanese itn
BuyuanCui Jul 15, 2024
21604ab
fixing typo
BuyuanCui Jul 16, 2024
5053f90
updates on era abbreviations
BuyuanCui Jul 16, 2024
f6b2d44
updates on era abbreviations
BuyuanCui Jul 16, 2024
17a38af
updates on the normalization comments
BuyuanCui Jul 16, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -116,6 +116,11 @@ def __init__(
from nemo_text_processing.inverse_text_normalization.hy.verbalizers.verbalize_final import (
VerbalizeFinalFst,
)
elif lang == 'ja': # Japanese
from nemo_text_processing.inverse_text_normalization.ja.taggers.tokenize_and_classify import ClassifyFst
from nemo_text_processing.inverse_text_normalization.ja.verbalizers.verbalize_final import (
VerbalizeFinalFst,
)

self.tagger = ClassifyFst(
cache_dir=cache_dir, whitelist=whitelist, overwrite_cache=overwrite_cache, input_case=input_case
Expand Down
17 changes: 17 additions & 0 deletions nemo_text_processing/inverse_text_normalization/ja/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
# Copyright (c) 2023, NVIDIA CORPORATION. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

from nemo_text_processing.inverse_text_normalization.ja.taggers.tokenize_and_classify import ClassifyFst
from nemo_text_processing.inverse_text_normalization.ja.verbalizers.verbalize import VerbalizeFst
from nemo_text_processing.inverse_text_normalization.ja.verbalizers.verbalize_final import VerbalizeFinalFst
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
# Copyright (c) 2023, NVIDIA CORPORATION. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

from nemo_text_processing.inverse_text_normalization.ja.taggers.tokenize_and_classify import ClassifyFst
from nemo_text_processing.inverse_text_normalization.ja.verbalizers.verbalize import VerbalizeFst
from nemo_text_processing.inverse_text_normalization.ja.verbalizers.verbalize_final import VerbalizeFinalFst
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Copyright (c) 2022, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
Loading