Post

Introducing Phi-3 WebGPU

Introducing Phi-3 WebGPU, a private and powerful AI chatbot that runs locally in your browser, powered by ๐Ÿค— Transformers.js and onnxruntime-web!

๐Ÿ”’ On-device inference: no data sent to a server โšก๏ธ WebGPU-accelerated (> 20 t/s) ๐Ÿ“ฅ Model downloaded once and cached

Phi-3 running at 42 tokens per second 100% locally in your browser! ๐Ÿคฏโšก๏ธ

What speed do you get?

Try it out! ๐Ÿ‘‡ ( from Xenova)

https://huggingface.co/spaces/Xenova/experimental-phi3-webgpu

This post is licensed under CC BY 4.0 by the author.