70b llm gpu. 5 t/s, with fast 38t/s GPU prompt processing.