v1.4.1 - cuda/vram/multiprocessing/threading
Properly implemented multithreading/processing to make sure the CUDA/VRAM usage (and the GUI in general) doesn't freeze when creating the vector database nor when querying the database.
Updated pro tip to reflect reliable comments on Discord regarding larger LLMs being helpful for especially technical jargon.