Provide a simple table of time, tokens used, and success (Pass@10) rate for different models (and hardware for time).