日本电子维修技术 装机/软件US Department of Energy 未来要和 IBM 和 nvi




Summit和Sierra现有的CUDA代码将被NTR到AMD ROCm上



ESSRNV4VAAAdsNs.jpg (204.04 KB, 下载次数: 0)

2020-3-5 12:38 上传





捕获.JPG (107.69 KB, 下载次数: 0)

2020-3-5 12:26 上传










https://www.nextplatform.com/2020/03/04/lawrence-livermore-to-surpass-2-exaflops-with-amd-compute/

IBM did not get any piece of the CORAL-2 contract and neither did Nvidia, and it is highly unlikely that a future Argonne machine that could happen some years hence will be based on IBM Power10 or Power11 CPUs and future Nvidia GPUs. It is much more likely that it will be an all-AMD machine like Frontier and El Capitan. And while no company is dependent on supercomputer contracts like the CORAL-2 deal to sustain their businesses, such deals help pay for research and development for future products that can be commercialized for other customers – and sold at much, much higher margins.

Back in August, when some of the details of the El Capitan machine were divulged by Lawrence Livermore, it seemed a bit coy not to talk about what CPUs and GPUs were going to be used in the system. But that was not the intent. There was actually some game theory going on here, which is what you would expect from an organization that does world-class simulations.

“Lawrence Livermore uses best value procurements, and our decision was based on evaluating the options that were available in the timeframe that we needed,” explained Bronis de Supinski, chief technical officer at Livermore Computing, the division of the lab that architects and runs its supercomputers, during a conference call announcing the awarding of the compute engines to AMD. “There were others, and based on the performance that we expect the AMD processors to deliver to our actual workload, our decision was that they would provide by far the best value to the government.”




AMD, Cray, and Lawrence Livermore did not give any more specifics about the El Capitan architecture, except to say that it would be using a single-socket server Epyc linked coherently to four Radeon Instinct GPU cards so they can share memory, and that this is a distinguishing feature for the architecture to simplify programming. Norrod did say that this Radeon Instinct card was being create din conjunction with key HPC and AI customers like Lawrence Livermore and that it would support all kinds of mixed precision as well as the single and double precision floating point operations that HPC centers require, and that it would also pack a future HBM memory technology. Norrod also said that AMD would be working with Lawrence Livermore to tightly integrate OpenMP into the ROCm programming environment that Oak Ridge will also be helping to widen and deepen on the Frontier system.

All of that extra compute is something that Lawrence Livermore desperately needs because as nuclear weapons in the US stockpile age, we need to run more sophisticated models than can even be done at a reasonable speed on the 150 petaflops Sierra hybrid CPU-GPU system.

“As the nuclear stockpile ages, the complexity of the simulations only increases,” explained de Supinski. “So we need to be able to use larger and larger systems in order to maintain the level of assurance that the nation really needs. And El Capitan, with its significant performance, will meet that need. In particular, it will make it so we can do 3D simulations on a regular basis. So simulations that now require all or a significant portion of Sierra will be able to run routinely, which means that we will be able to have much greater statistical confidence in the results and the model that we use to provide the certification will be more accurate.”
Being a hybrid CPU-GPU machine, there is a temptation to think of El Capitan as Oak Ridge does with its current Summit and future Frontier machines, and that is as an AI-HPC supercomputer. But that is not what Sierra and El Capitan are really about. As Lawrence Livermore explained back in August, not only do the existing nuclear weapons need to be simulated to see if they can work – the Nuclear Test Ban Treaty prevents us from blowing one up to know for sure – but also to completely redesign the nuclear weapons and reuse their nuclear explosives without being able to test them and still know they will work. This is an incredibly massive and difficult set of simulations and designs.

“Our workloads are primarily not deep learning models, although we are exploring something we call cognitive simulation, which brings deep learning and other AI models to bear on our workloads by evaluating how they can accelerate our simulations and how they can also improve their accuracy and find where they actually work,” explained de Supinski. “And so for that, we see this system as providing some significant benefits because of those operations. But I think it’s important to understand that that the primary goal of this system is large scale physics simulation and not deep learning.”





评论
哈哈哈,小伙子看上了廉价货

评论
钱永远都是不够的,能省则省

评论
NVIDIA在计算业的吃相也太难看了一点 被金主门拒绝也是很正常的事情

评论
配图好评

评论

还好吧。。。

评论
转贴机KAG了!

评论
转总已经完全胜利,现在是趁胜追击的时候了

评论
三大超算这就翻车了一台

评论

是牙膏的翻车了?

评论

DOE 的操行就是只要达到要求,谁便宜谁接单,完全不管低下人死活。估计那些用CUDA的人员已经开始撞墙砸桌子了。

评论

一样的,16年公司开始转型自主,以前win+I平台很多积累都放弃了,没办法,大战略,下面的人要么学习要么滚蛋

评论

截屏2020-03-06下午1.16.33.jpg (171.39 KB, 下载次数: 0)

2020-3-6 13:19 上传



有转译的,之前还很原始的时候研究过api层面是完全一模一样的。这其实没问题,cuda的api部分本来就是开源的。

然后绝大部分开发人员做的都是框架上的东西,目前rocm的框架版本号也追上来了,比如前不久tensorflow就发了rocm(2.4)tf 2.0已经非常接近cudnn version,这个甚至不是转译,是原生的厂家实现。

痛苦的是那些底部调优的人员和完全绕开厂家一方库自写底层加速库的,写kernel function的那批,cuda的调优代码全都不能用了。


评论

就算API接口能兼容,感觉不同的需求之间,模型粒度都可能会有很大区别,一点小问题就能卡半天,不知道这么多年过去了,是不是真的抽象得那么好都能适应了。

评论

api一样最后表现的行为不一样很正常,就算是同一个库升级个版本号后让项目垮掉的都比比皆是。但转译只是权衡之技,仅针对遗留代码。

评论

AMD要是可以和伺候微软和索尼一样服务到家还行。可惜米国政府就是孙子一样的存在,哪怕这些系统主要是核项目,还是不如COD的啪啪啪重要。

评论

同样是异构让两家公司做不如让同一家公司做,doe并不是为了省钱而如此,而是考虑到更统一的操作性。将来异构还会被更高级的融合形态代替(放大版的apu,或者巨型规模的soc),doe是门清的。

评论

有钱嘛再重写一下程序就行了。hpc上程序其实也是随便写写。而且hpc上烂优化的程序多了,都是博士生写的,反正力大砖飞。

说不定他们就喜欢amd这种,核心缓存的暴力。 电路 电子 维修 我现在把定影部分拆出来了。想换下滚,因为卡纸。但是我发现灯管挡住了。拆不了。不会拆。论坛里的高手拆解过吗? 评论 认真看,认真瞧。果然有收 电路 电子 维修 求创维42c08RD电路图 评论 电视的图纸很少见 评论 电视的图纸很少见 评论 创维的图纸你要说 版号,不然无能为力 评论 板号5800-p42ALM-0050 168P-P42CLM-01
 ·日本留学生活 求个大阪合租
·日本留学生活 自家房招租求
·日本留学生活 东京地区出9成新lv钱包
·日本育儿教育 孩子从国内过来如何学习日语
·日本育儿教育 明年四月横滨招月嫂
·日本育儿教育 请问咋让娃突破识字关?感谢分享中文共读和学习经验的妈妈
 ·中文新闻 东区明星迈克尔·格列柯,53 岁,将在第一次出生两年后第二次
·中文新闻 《爱情岛》明星卡米拉·瑟洛和杰米·朱维特在透露即将迎来第三

维修经验

CPUcpu-z 1.77版低调发布

日本维修技术更新: New benchmark “submit and compare” feature New clocks dialog reporting all system’s clock speeds in real-time Preliminary support for Intel Kaby Lake AMD Bristol Ridge processors 主要是增加了支持I、A两个新架构的 ...

维修经验

CPU这几天经常开机黑屏,热重启后又正常

日本维修技术这几天经常开机黑屏,热重启后又正常,今天热重启也不管用了。折腾半天总算点亮,显示超频失败,以前出这个画面我是不理它的,直接重启就能正常进系统了,今天不敢托大,因为 ...

维修经验

CPU超频求助!关于华擎H170和6700K

日本维修技术问题见楼主的show贴 https://www.chiphell.com/thread-1634895-1-1.html 这次华擎的H170 Hyper最大的特色应该是自带时钟发生器可以自由超外频 可是楼主好久没有折腾超频了。。。 两图中除了CPU外频 以 ...

维修经验

CPU液态金属会侵蚀cpu核心吗?

日本维修技术前阵子看到有人说,液态金属时间长了会侵蚀cpu铜盖,那么问题来了,这货会不会侵蚀核心呢? 评论 这玩意儿好像只对铝起反应 评论 不是说,cpu的盖子是铜的吗。。。 评论 不会,核 ...

维修经验

CPUm6i究竟支不支持e3 1231v3

日本维修技术官网上看支持列表没写有e3 1231v3,装机帖又有人晒,百度也没个明确答案,那究竟能不能点亮?有在用的chher说一下么 评论 升级最新bios肯定可以支持 评论 我的p67evo官网上也没说支持12 ...

维修经验

CPU华擎 HYPER 妖板 正确玩法

日本维修技术600元的 B150,10相供电,释放洪荒之力 注意必须官网 Beta 区的 BIOS 有 AVX 的 CPU 可能会掉缓存 启动时按 X 键激活 SKY OC,重启后进入 BIOS 160924164727.jpg (95.63 KB, 下载次数: 1) 2016-9-24 17:47 上传 ...

维修经验

CPUE5 2686 V3和i7 6800K如何选择

日本维修技术默认用,不超频,两者功耗是一模一样的 E5 2686 V3:2.0主频,3.5睿频, 18核心36线程 ,45M L3 咸鱼大约2500~3000元 i7 6800K : 3.5主频,3.8睿频 ,6核心12线程 ,盒装3000元 评论 性能应该是26 ...

维修经验

CPUHD530硬解4K能力还是有点弱呀!

日本维修技术播放器用PotPlay 64bit,各种优化后,跑4K @120Hz视频只能到70帧左右的速度,勉强能用! 显示器用的4K的优派VP2780 未标题-1.jpg (211.97 KB, 下载次数: 0) 2016-9-26 21:29 上传 评论 这个估计你没优化 ...

维修经验

CPU6900k 1.25V到4.2体质怎么样

日本维修技术如图,体质怎么样,ring是35,没敢试了,都说ring高了毁硬件 评论 不错的U,但不算雕,上4.4就大雕了,这电压上4.5的目前没见有人发图 评论 谢谢前辈告知 评论 我这个用1.2V超的4.2,R ...

维修经验

CPUI3 6100 华擎B150M pro4超4.5g测试。

日本维修技术看看论坛没多少i3 6100的帖子,就转下自己发的show贴里面的数据,给大家参考下。家里还有当年的神U i3 540 oc 4.5G在给老妈用。 不知道数据上正常吗?有6100的朋友可以告诉下,另外是不有 ...

维修经验

CPU7系u会兼容100系主板吗?

日本维修技术RT,听说要推200系板,100系还能用吗以后。。 评论 兼容的 评论 感谢!以后换u就行了,目前消息200系板会有新的特性吗? 评论 24条PCI-E 3.0通道、支持Intel Optane混合存储技术、十个USB 3 ...

维修经验

CPU有心入5820k了,求教下温度问题

日本维修技术一直徘徊在6700k和5820k之间,6700k现在这德行直接把我推向了5820k啊,从2600k升级上来,三大件都要换,现在唯一疑惑的是IB-E ex这种顶级风冷能不能压住4.5g的5820呢?毕竟刚刚买一个多月。 ...

维修经验

CPU6600&6600K才100的差价

日本维修技术太少了吧。。。 6600.JPG (106.91 KB, 下载次数: 0) 2016-10-1 10:30 上传 评论 毕竟只是i5而已…… 评论 上z170 6600也能超,等于没区别,差价能有100已经不错了 评论 然后又见不超频人士推荐超频 ...