0.0.4
- Tokenization: Convert input text into a list of token IDs, allowing you to process and manipulate text at the token level, default model
all-MiniLM-L6-v2
. - Detokenization: Reconstruct original text from a list of token IDs, allowing you to reverse the tokenization process, default model
all-MiniLM-L6-v2
.
0.0.3
- Adaptive Throttling: Implemented an adaptive throttling mechanism that delays requests using the
Retry-After
header when errors are encountered due to high request frequency or processing failures. The delay is dynamically adjusted based on the client’s request rate and error occurrences.