How to quantize to gguf using llama.cpp correctly

@asirgogogo I tried `convert_hf_to_gguf.py` but get errror "ERROR:hf-to-gguf:Model IndexForCausalLM is not supported".
The old `examples/convert_legacy_llama.py` can convert to gguf. but this gguf output meaningless repeated characters only.