Liquid AI’s LFM 2.5 runs a vision-language model locally in your browser via WebGPU and ONNX Runtime, working offline once ...
Abstract: We present GLEE in this work, an object-level foundation model for locating and identifying objects in images and videos. Through a unified framework, GLEE accomplishes detection, ...