Package com.reign.kat.lib.voice.receive
Class VoiceRecognition
java.lang.Object
com.reign.kat.lib.voice.receive.VoiceRecognition
- All Implemented Interfaces:
IAudioRecvListener
-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionstatic void
init()
static VoiceRecognition
instance()
static boolean
void
onUserFinishedSpeaking
(net.dv8tion.jda.api.entities.Member member, AudioUser data) static byte[]
transcode
(byte[] origData) Converts audio data from Discord's format (48Khz, 16-Bit Big-endian Stereo) to a format that VOSK needs (16Khz 16-Bit Little-endian Mono).static String
wakeWordUttered
(String speech)
-
Field Details
-
model
public static org.vosk.Model model
-
-
Constructor Details
-
VoiceRecognition
public VoiceRecognition()
-
-
Method Details
-
instance
-
init
public static void init() -
isModelReady
public static boolean isModelReady() -
onUserFinishedSpeaking
- Specified by:
onUserFinishedSpeaking
in interfaceIAudioRecvListener
-
wakeWordUttered
-
transcode
public static byte[] transcode(byte[] origData) Converts audio data from Discord's format (48Khz, 16-Bit Big-endian Stereo) to a format that VOSK needs (16Khz 16-Bit Little-endian Mono).- Parameters:
origData
- audio PCM data to convert- Returns:
- 16Khz mono audio
-