澳门银河赌场注册送38-澳门银河赌场招人好招吗_百家乐德州_全讯网下载 (中国)·官方网站

您所在的位置:首頁 - 學術報告

學術報告

Efficiently Running Al WorkloadsUsing Long SlMD and Matrix lSAs

微信圖片_20241008090634.jpg

主講人:MarcCasas Guix 巴塞羅那超算中心

時間:2024年10月7日9:30-11:30

地 點:主樓B1421

主持人:劉偉峰


主講人簡介:

Marc Casas is a technica researchlead at the Barcelona SupercomputingCenter (BSc)andlecturer attheUniversitat Polit è cnica de Catalunya(UPC). His researchlays betweencomputer architecture(e.g,memoryaddresstranslation,andvector architectures)high-performance computing(e.g.sparse linear algebraparallel deep learning). He is the technicallead of theSONAR (parallelSOftware and New ARchitectures)research group,composed of PhD students, engineers,and postdocs. Marc has lead BSC contributions to severaeuropean projects (Mont-Blanc2020,European RrocessoiInitiative, etc.), and research collaborations with nteandlBM.

Marc has been at Bcsince 2013.He was apostdoctoral research scholar at the Lawrence LivermoreNationalLaboratory(LLNL)from2010 to 2013.He receivedthe Marie Curie and Ramón y Cajal Fellowships on 2014and 2018,respectively.He obtained a 5-years degreein mathematics in 2004,and a PhD degree in ComputerScience in 2010 from the Universitat Politècnica deCatalunya (UPC).


內容摘要:

This talk will show how state-of-the-art proposalsto compute convolutions on architectures with CPUsupporting SlMD instructions deliver poor performancefor long SlMD lengths due to freguent cache conflictmisses.The talk will propose new algorithmic approachesto mitigate the limitation of state-of-the-art proposals viathe adaptation of the amount of computation exposed tothe microarchitecture to mitigate cache misses, and theredefinition of the activation memory layout to improvethe memory access pattern.These algorithmic approachesMatrix Tile Extension(MT),a novewill motivate thematrix Instruction-Set Architecture (lSA) that completelydecouples the instruction set architecture from thmicroarchitecture and seamlessly interacts with existincvectorISAs.MTEincurs minimalimplementation overheacsince it only requires a few additional instructions and a64-bit Control Status Register (CSR) to keep its state, andbeats the best state-of-the-art matrix lSA by 1.20x.


庆安县| 百家乐官网压分技巧| 百家乐双层筹码盘| 利博娱乐| 玩百家乐去哪个平台好| 克拉克百家乐官网下载| 百家乐押注方法| 百家乐官网五子棋| 先锋百家乐的玩法技巧和规则 | 澳门博彩在线| 百家乐半圆桌| 百家乐官网技术论坛| 大发888娱乐总代理qq| 百家乐分析仪博彩正网| 至尊百家乐官网奇热| 德州扑克算法| 百家乐赌博代理合作| 旧金山百家乐官网的玩法技巧和规则 | 百家乐号公| 百家乐官网免费体验金| 广州百家乐官网娱乐场| 利来| 太阳城的故事| 赌博百家乐趋势把握| 泰山百家乐官网的玩法技巧和规则 | 大发888官方下载网址| 百家乐太阳城娱乐城| 百家乐官网的玩法技巧和规则 | 大发888在线投注| 试玩百家乐网| 七匹狼百家乐官网的玩法技巧和规则| 百家乐官网客户端LV| 金宝网| 百家乐扑克桌| 百家乐有没有攻略| 百家乐官网是娱乐场最不公平的游戏 | 百家乐官网五湖四海娱乐城| 江陵县| 永利高足球平台| 太阳城巴黎左岸| 百家乐视频游365|