JikkyoSubtitle: Real-time transcription and translation of system audio into Chinese.
- Hide Border
- Always on Top
- Translation/Original Only
- Scrolling/Fixed Subtitles
- Download: Obtain the pre-built binary from the latest release in the Releases section.
- Configure: Modify the
config.json
file. Detailed instructions for each setting are provided inconfigs/README.md
. - Launch: Run
realtime-bilingual-asr.exe
. - Settings: After launching, you can adjust settings by right-clicking the system tray icon.
Note:
- Ensure that the directory containing
realtime-bilingual-asr.exe
has write permissions for the application to function correctly. - Please refer to the configs/README.md for the first-time setup.
Prerequisites:
- Visual Studio with the "Desktop development with C++" workload: Ensure it includes the MSVC compiler and CMake.
- vcpkg: Installed and integrated with your system (you've likely already done this if you have a vcpkg.json). Make sure
VCPKG_ROOT
environment variable is set andvcpkg integrate install
has been run. - Git: For fetching the project (if applicable).
- (Optional) VC-LTL: Just follow the steps for "Using VC-LTL in CMake." No changes to the CMake and vcpkg commands are needed.
Steps:
-
Clone (if needed):
git clone https://github.com/Sg4Dylan/JikkyoSubtitle.git cd JikkyoSubtitle
-
Install Dependencies (using vcpkg manifest mode):
vcpkg install
-
Configure:
cmake -B build -S . -DCMAKE_TOOLCHAIN_FILE=%VCPKG_ROOT%/scripts/buildsystems/vcpkg.cmake
-B build
: Creates abuild
directory for out-of-source build.-S .
: Specifies the source directory (current directory).-DCMAKE_TOOLCHAIN_FILE
: Points CMake to the vcpkg toolchain file.
-
Build:
cmake --build build --config Release
--config Release
: Builds the Release configuration. Change toDebug
for debugging.
-
Run: Your executable should be in the
build/Release
orbuild/Debug
folder.
This project is licensed under the AGPL. It incorporates the following open-source projects:
- Windows-universal-samples (by microsoft): MIT license.
Used to capture audio. - sherpa-onnx (by k2-fsa): Apache-2.0 license.
Used for voice activity detection & speech recognition. - libwtfdanmaku (by copyliu): LGPL 2.1 License.
Used for rendering on-screen subtitle. - IXWebSocket (by machinezone): BSD-3 Clause License.
Used for HTTP/WebSocket implementation. - JSON for Modern C++ (by nlohmann): MIT License.
Used for JSON serialization/deserialization. - VC-LTL5 (by Chuyu-Team): EPL-2.0 license.
Used to enhance program compatibility.
Please refer to the respective project repositories for full license details.