<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ycui7</title><link>https://news.ycombinator.com/user?id=ycui7</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 29 Apr 2026 18:45:21 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ycui7" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>Exiting dGPU for gaming, but staying in the LLM world.</p>
]]></description><pubDate>Wed, 29 Apr 2026 07:45:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47945282</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47945282</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47945282</guid></item><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>B70 idles at 30W, while RTX PRO 4500 idles at 9W (measured to be 5W at wall).<p>B70 runs at 1/3 token output rate of RTX PRO 4500 and consume 3X idle power when do nothing.</p>
]]></description><pubDate>Wed, 29 Apr 2026 07:42:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47945265</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47945265</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47945265</guid></item><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>When you get 4 of these, the idle power alone is 120W. That is a lot of electricity if left on 24/7.<p>At that power consumption, you also end up being more expensive than API calls and many times slower. It starts to feel very stupid to run local interference.<p>If the client is very keen on privacy, then they can pay for the NVIDIA.<p>I end up returning my B70s, and bought RTX PRO 6000.</p>
]]></description><pubDate>Wed, 29 Apr 2026 07:37:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47945227</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47945227</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47945227</guid></item><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>At this speed, people end up paying more on electricity than api calls. (California electricity)</p>
]]></description><pubDate>Wed, 29 Apr 2026 07:31:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=47945197</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47945197</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47945197</guid></item><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>You can get 120TPS (144 peak) with Qwen3.6-27B on RTX PRO 6000 with autoround when MTP enabled. It runs faster than sonnet api calls.<p>5090 gets maybe 100TPS with MTP</p>
]]></description><pubDate>Wed, 29 Apr 2026 07:28:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=47945175</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47945175</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47945175</guid></item><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>Problem is the more B70 you have, the slower the inference it gets(due to terrible software atm). A single B70 is almost barely faster than CPU inference. If you have 4 B70, you might as well run interference on CPU and be faster with cheaper DDR5 instead of GDDR6.</p>
]]></description><pubDate>Wed, 29 Apr 2026 07:25:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=47945155</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47945155</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47945155</guid></item><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>Intel Arc B70 when released, can only produce 1/3 of the token of RTX PRO 4500. Well, it also cost 1/3 of RTX PRO 4500.<p>It lacked software support the for the primary target application, running LLM. The officially supported vllm fork is 6 version behind mainline. It did not run the latest hot new open models on huggingface. Parallel two of B70 reduce token rate, not improve it. So, the software behind B70 is basically so far behind.</p>
]]></description><pubDate>Wed, 29 Apr 2026 03:32:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47943900</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47943900</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47943900</guid></item><item><title><![CDATA[New comment by ycui7 in "Intel Arc Pro B70 Review"]]></title><description><![CDATA[
<p>It is weird that the reviewer does not mention RTX PRO 6000 96GB, but mentioned RTX PRO 5000 72GB. 72GB RTX PRO 5000 is a special order, and much less people are aware of it. RTX PRO 6000 is known by mostly everyone in the LLM world.<p>I cannot understand why would a tech reviewer do that.</p>
]]></description><pubDate>Wed, 29 Apr 2026 03:28:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47943877</link><dc:creator>ycui7</dc:creator><comments>https://news.ycombinator.com/item?id=47943877</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47943877</guid></item></channel></rss>