请问谁知道如何让ollama支持npu

主题：请问谁知道如何让ollama支持npu

正序浏览

主题图片汇总

主题内检索

虚化

禁言中

泡网分: 32.938

主题: 133

帖子: 7506

2025-02-10 20:52

手头一台ultra185h笔记本和一台mac mini m4都有npu，这两天在玩ollama，发现cpu gpu都用上了，专门给AI用的npu却没用上，太讽刺了
谁知道如何让ollama支持npu？

相关标签： Mini

(0)

只看该作者

不看该作者

加入黑名单

返回顶部

个人图文集

个人作品集

福建省

第 1 帖

浏览：5451 回帖：6

第 1 页 / 共 1 页

游客没有回帖权限

一晴方觉夏深

泡菜

泡网分: 1.971

主题: 92

帖子: 1527

2025-02-22 15:07

期待ollama能够支持M芯片的NPU

(0)

只看该作者

不看该作者

加入黑名单

返回顶部

个人图文集

个人作品集

重庆

第 7 帖

闲聊状态

泡菜

泡网分: 18.34

主题: 11

帖子: 15211

2025-02-22 12:38

INTEL的CPU里的NPU 可以看看 github上的 ipex 这个项目，INTEL 专属，其中就有NPU的支持

(0)

只看该作者

不看该作者

加入黑名单

返回顶部

个人图文集

个人作品集

山东省

第 6 帖

虚化

禁言中

泡网分: 32.938

主题: 133

帖子: 7506

2025-02-21 22:49

微软提供了Copilot+ PC的npu支持，回头在我的ultra 185h上试试看

https://learn_microsoft_com/en-us/windows/ai/toolkit/toolkit-getting-started?utm_source=chatgpt_com&tabs=rest

Running Distilled DeepSeek R1 models locally on Copilot+ PCs, powered by Windows Copilot Runtime

The Neural Processing Unit (NPU) on Copilot+ PCs offers a highly efficient engine for model inferencing,
unlocking a paradigm where generative AI can execute not just when
invoked, but enable semi-continuously running services. This empowers
developers to tap into powerful reasoning engines to build proactive and
sustained experiences.
With our work on Phi Silica, we were able to harness highly efficient inferencing – delivering very
competitive time to first token and throughput rates, while minimally
impacting battery life and consumption of PC resources. Running models
on the NPU is about speed and efficiency. For example, as mentioned in
previous posts, the Phi Silica token iterator on the NPU exhibits a 56%
improvement in power consumption compared to operating on the CPU. Such
efficiency enables new experiences that demand such state-of-the art
models to be in the main loop of the program without draining your
battery or overly heating your device. The optimized DeepSeek models for
the NPU take advantage of several of the key learnings and techniques
from that effort, including how we separate out the various parts of the
model to drive the best tradeoffs between performance and efficiency,
low bit rate quantization and mapping transformers to the NPU.
Additionally, we take advantage of Windows Copilot Runtime (WCR) to
scale across the diverse Windows ecosystem with ONNX QDQ format.

(0)

只看该作者

不看该作者

加入黑名单

返回顶部

个人图文集

个人作品集

福建省

第 5 帖

CATWK

泡菜

泡网分: 25.886

主题: 495

帖子: 15624

2025-02-11 09:08

npu是不用cuda的吧？可能需要driver
CATWK 编辑于 2025-02-11 09:09

(0)

只看该作者

不看该作者

加入黑名单

返回顶部

个人图文集

个人作品集

北京

第 4 帖

qingcai

资深泡菜

泡网分: 29.483

主题: 8

帖子: 4097

2025-02-10 23:01

感觉搞一个 mac pro 96gb ram的最给力了。

(0)

只看该作者

不看该作者

加入黑名单

返回顶部

个人图文集

个人作品集

美国

第 3 帖

notebook1

陈年泡菜

泡网分: 26.718

主题: 135

帖子: 4534

年内主力装备

★ LEICA Q3

★ Canon EOS R5

2025-02-10 22:58

ollama 是用gpu纯计算的。
其实 mac的统一内存合适，一般弄张几十G显存的显卡很贵
而Mac 内存和显存是统一的

(0)

只看该作者

不看该作者

加入黑名单

返回顶部

个人图文集

个人作品集

北京

第 2 帖

< 返回版块列表

第 1 页 / 共 1 页

回复主题：请问谁知道如何让ollama支持npu

功能受限

游客无回复权限，请登录后再试

回复主题： 请问谁知道如何让ollama支持npu

回复主题：请问谁知道如何让ollama支持npu