首页 > 最新文献

2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)最新文献

英文 中文
Thanos: High-Performance CPU-GPU Based Balanced Graph Partitioning Using Cross-Decomposition 灭霸:基于交叉分解的高性能CPU-GPU均衡图分区
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045588
Dae-Hee Kim, R. Nagi, Deming Chen
As graphs become larger and more complex, it is becoming nearly impossible to process them without graph partitioning. Graph partitioning creates many subgraphs which can be processed in parallel thus delivering high-speed computation results. However, graph partitioning is a difficult task. In this work, we introduce Thanos, a fast graph partitioning tool which uses the cross-decomposition algorithm that iteratively partitions a graph. It also produces balanced loads of partitions. The algorithm is well suited for parallel GPU programming which leads to fast and high-quality graph partitioning solutions. Experimental results show that we have achieved 30x speedup and 35% better edge cut reduction compared to the CPU version of the popular graph partitioner, METIS, on average.
随着图变得越来越大,越来越复杂,如果不进行图分区,几乎不可能对图进行处理。图分区创建了许多子图,这些子图可以并行处理,从而提供高速的计算结果。然而,图划分是一项困难的任务。在这项工作中,我们介绍了Thanos,一个快速的图分区工具,它使用交叉分解算法迭代地划分图。它还产生均衡的分区负载。该算法非常适合并行GPU编程,可实现快速、高质量的图划分解决方案。实验结果表明,与流行的图形分区器METIS的CPU版本相比,我们平均实现了30倍的加速和35%的边缘减少。
{"title":"Thanos: High-Performance CPU-GPU Based Balanced Graph Partitioning Using Cross-Decomposition","authors":"Dae-Hee Kim, R. Nagi, Deming Chen","doi":"10.1109/ASP-DAC47756.2020.9045588","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045588","url":null,"abstract":"As graphs become larger and more complex, it is becoming nearly impossible to process them without graph partitioning. Graph partitioning creates many subgraphs which can be processed in parallel thus delivering high-speed computation results. However, graph partitioning is a difficult task. In this work, we introduce Thanos, a fast graph partitioning tool which uses the cross-decomposition algorithm that iteratively partitions a graph. It also produces balanced loads of partitions. The algorithm is well suited for parallel GPU programming which leads to fast and high-quality graph partitioning solutions. Experimental results show that we have achieved 30x speedup and 35% better edge cut reduction compared to the CPU version of the popular graph partitioner, METIS, on average.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114978125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Quantity Evaluation and Reconfiguration Mechanism for Signal- and Power-Interconnections in 3D-Stacking System 三维堆叠系统中信号与电源互连的数量评估与重构机制
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045169
Ching-Hwa Cheng
Due to the high integration required for system application, the three-dimensional chip may resolve this requirement. The three-dimensional vertically stacking (3D-stacking) systems have been proposed to satisfy these requirements. However, the 3D-stacking system contains several design risks from its long layer interconnections. For a 3D-stacking system, it is difficult to identify where the numerous power and signal-interconnection are open-, shorted-fault, or resistive-short has accrued. Therefore, solving these interconnection problems is necessary. A feasible interconnection quality-evaluation, fault-diagnosis, and connection-reconfigurable mechanism are proposed. The proposed interconnection-measurement-recovery (IMR) mechanism will make it easy to find interconnection faults and make recovery in 3D-Stacking systems. The proposed IMR can detect interconnection open, short, bridge and resistive defects with the path-reroute mechanism. Future more, the signal transmission quality can be measured. This measurement provides to monitor signal propagation in pico-second accuracy. IMR has less extra area and power consumption overhead. The feasibilities of the proposed mechanism have been justified by 2D-chip and 3D-stacking MorPack both systems.
由于系统应用对集成度要求很高,三维芯片可以解决这一要求。为了满足这些要求,提出了三维垂直堆叠(3D-stacking)系统。然而,由于其长层互连,3d堆叠系统存在一些设计风险。对于3d堆叠系统来说,很难识别大量的电源和信号互连在哪里是开路、短路-故障或电阻-短路。因此,解决这些互连问题是必要的。提出了一种可行的互连质量评估、故障诊断和连接可重构机制。所提出的互连-测量-恢复(IMR)机制可以方便地发现三维堆叠系统中的互连故障并进行恢复。利用路径重定向机制,该方法可以检测互连的开路、短路、桥接和电阻性缺陷。未来,还可以测量信号的传输质量。这种测量提供了以皮秒精度监测信号传播。IMR具有较少的额外面积和功耗开销。通过3d芯片和3d堆叠MorPack系统验证了该机制的可行性。
{"title":"A Quantity Evaluation and Reconfiguration Mechanism for Signal- and Power-Interconnections in 3D-Stacking System","authors":"Ching-Hwa Cheng","doi":"10.1109/ASP-DAC47756.2020.9045169","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045169","url":null,"abstract":"Due to the high integration required for system application, the three-dimensional chip may resolve this requirement. The three-dimensional vertically stacking (3D-stacking) systems have been proposed to satisfy these requirements. However, the 3D-stacking system contains several design risks from its long layer interconnections. For a 3D-stacking system, it is difficult to identify where the numerous power and signal-interconnection are open-, shorted-fault, or resistive-short has accrued. Therefore, solving these interconnection problems is necessary. A feasible interconnection quality-evaluation, fault-diagnosis, and connection-reconfigurable mechanism are proposed. The proposed interconnection-measurement-recovery (IMR) mechanism will make it easy to find interconnection faults and make recovery in 3D-Stacking systems. The proposed IMR can detect interconnection open, short, bridge and resistive defects with the path-reroute mechanism. Future more, the signal transmission quality can be measured. This measurement provides to monitor signal propagation in pico-second accuracy. IMR has less extra area and power consumption overhead. The feasibilities of the proposed mechanism have been justified by 2D-chip and 3D-stacking MorPack both systems.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121234328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reutilization of Trace Buffers for Performance Enhancement of NoC based MPSoCs 基于NoC的mpsoc性能增强的跟踪缓冲区重用
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045570
Sidhartha Sankar Rout, M. Badri, Sujay Deb
The contemporary network-on-chips (NoCs) are so complex that capturing all network functional faults at presilicon verification stage is nearly impossible. So, on-chip design-for-debug (DfD) structures such as trace buffers are provided to assist capturing escaped faults during post-silicon debug. Most of the DfD modules are left idle after the debug process. Reuse of such structures can compensate for the area overhead introduced by them. In this work, the trace buffers are reutilized as extended virtual channels for the router nodes of an NoC during in-field execution. Optimal distribution of trace buffers among the routers is performed based upon their load profiling. Experiments with several benchmarks on the proposed architecture show an average of 11.36% increase in network throughput and 13.97% decrease in average delay.
当前的片上网络(noc)非常复杂,在预硅验证阶段捕获所有网络功能故障几乎是不可能的。因此,提供了诸如跟踪缓冲区之类的片上调试设计(DfD)结构,以帮助在硅后调试期间捕获已转义的错误。调试结束后,大部分DfD模块处于空闲状态。这种结构的重用可以补偿它们带来的面积开销。在这项工作中,跟踪缓冲区在现场执行期间被重新利用为NoC的路由器节点的扩展虚拟通道。跟踪缓冲区在路由器之间的最优分布是基于路由器的负载分析。基于该架构的多个基准测试表明,网络吞吐量平均提高了11.36%,平均延迟降低了13.97%。
{"title":"Reutilization of Trace Buffers for Performance Enhancement of NoC based MPSoCs","authors":"Sidhartha Sankar Rout, M. Badri, Sujay Deb","doi":"10.1109/ASP-DAC47756.2020.9045570","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045570","url":null,"abstract":"The contemporary network-on-chips (NoCs) are so complex that capturing all network functional faults at presilicon verification stage is nearly impossible. So, on-chip design-for-debug (DfD) structures such as trace buffers are provided to assist capturing escaped faults during post-silicon debug. Most of the DfD modules are left idle after the debug process. Reuse of such structures can compensate for the area overhead introduced by them. In this work, the trace buffers are reutilized as extended virtual channels for the router nodes of an NoC during in-field execution. Optimal distribution of trace buffers among the routers is performed based upon their load profiling. Experiments with several benchmarks on the proposed architecture show an average of 11.36% increase in network throughput and 13.97% decrease in average delay.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"183 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125160030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Designing Efficient Shortcut Architecture for Improving the Accuracy of Fully Quantized Neural Networks Accelerator 为提高全量化神经网络加速器的精度设计高效快捷体系结构
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045739
Baoting Li, Longjun Liu, Yan-ming Jin, Peng Gao, Hongbin Sun, Nanning Zheng
Network quantization is an effective solution to compress Deep Neural Networks (DNN) that can be accelerated with custom circuit. However, existing quantization methods suffer from significant loss in accuracy. In this paper, we propose an efficient shortcut architecture to enhance the representational capability of DNN between different convolution layers. We further implement the shortcut hardware architecture to effectively improve the accuracy of fully quantized neural networks accelerator. The experimental results show that our shortcut architecture can obviously improve network accuracy while increasing very few hardware resources $( 0.11 times$ and $0.17 times$ for LUT and FF respectively) compared with the whole accelerator.
网络量化是压缩深度神经网络(DNN)的有效解决方案,可以通过定制电路进行加速。然而,现有的量化方法在精度上存在较大的损失。为了提高深度神经网络在不同卷积层之间的表示能力,本文提出了一种高效的快捷结构。进一步实现了快捷硬件架构,有效提高了全量化神经网络加速器的精度。实验结果表明,与整个加速器相比,我们的快捷架构可以明显提高网络精度,同时只增加很少的硬件资源(LUT和FF分别为0.11倍和0.17倍)。
{"title":"Designing Efficient Shortcut Architecture for Improving the Accuracy of Fully Quantized Neural Networks Accelerator","authors":"Baoting Li, Longjun Liu, Yan-ming Jin, Peng Gao, Hongbin Sun, Nanning Zheng","doi":"10.1109/ASP-DAC47756.2020.9045739","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045739","url":null,"abstract":"Network quantization is an effective solution to compress Deep Neural Networks (DNN) that can be accelerated with custom circuit. However, existing quantization methods suffer from significant loss in accuracy. In this paper, we propose an efficient shortcut architecture to enhance the representational capability of DNN between different convolution layers. We further implement the shortcut hardware architecture to effectively improve the accuracy of fully quantized neural networks accelerator. The experimental results show that our shortcut architecture can obviously improve network accuracy while increasing very few hardware resources $( 0.11 times$ and $0.17 times$ for LUT and FF respectively) compared with the whole accelerator.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115195677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Trigger Activation by Repeated Maximal Clique Sampling 通过重复最大团抽样自动触发激活
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045449
Yangdi Lyu, P. Mishra
Hardware Trojans are serious threat to security and reliability of computing systems. It is hard to detect these malicious implants using traditional validation methods since an adversary is likely to hide them under rare trigger conditions. While existing statistical test generation methods are promising for Trojan detection, they are not suitable for activating extremely rare trigger conditions in stealthy Trojans. To address the fundamental challenge of activating rare triggers, we propose a new test generation paradigm by mapping trigger activation problem to clique cover problem. The basic idea is to utilize a satisfiability solver to construct a test corresponding to each maximal clique. This paper makes two fundamental contributions: 1) it proves that the trigger activation problem can be mapped to clique cover problem, 2) it proposes an efficient test generation algorithm to activate trigger conditions by repeated maximal clique sampling. Experimental results demonstrate that our approach is scalable and it outperforms state-of-the-art approaches by several orders-of-magnitude in detecting stealthy Trojans.
硬件木马严重威胁计算系统的安全性和可靠性。使用传统的验证方法很难检测到这些恶意植入物,因为攻击者可能会在罕见的触发条件下隐藏它们。虽然现有的统计测试生成方法对木马检测很有希望,但它们不适合激活隐秘木马中极其罕见的触发条件。为了解决激活稀有触发器的基本挑战,我们通过将触发器激活问题映射到团覆盖问题,提出了一个新的测试生成范例。其基本思想是利用一个可满足性解算器来构造一个与每个极大团相对应的测试。本文做出了两个基本贡献:1)证明了触发激活问题可以映射到团覆盖问题,2)提出了一种有效的通过重复最大团采样激活触发条件的测试生成算法。实验结果表明,我们的方法具有可扩展性,并且在检测隐形木马方面优于最先进的方法几个数量级。
{"title":"Automated Trigger Activation by Repeated Maximal Clique Sampling","authors":"Yangdi Lyu, P. Mishra","doi":"10.1109/ASP-DAC47756.2020.9045449","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045449","url":null,"abstract":"Hardware Trojans are serious threat to security and reliability of computing systems. It is hard to detect these malicious implants using traditional validation methods since an adversary is likely to hide them under rare trigger conditions. While existing statistical test generation methods are promising for Trojan detection, they are not suitable for activating extremely rare trigger conditions in stealthy Trojans. To address the fundamental challenge of activating rare triggers, we propose a new test generation paradigm by mapping trigger activation problem to clique cover problem. The basic idea is to utilize a satisfiability solver to construct a test corresponding to each maximal clique. This paper makes two fundamental contributions: 1) it proves that the trigger activation problem can be mapped to clique cover problem, 2) it proposes an efficient test generation algorithm to activate trigger conditions by repeated maximal clique sampling. Experimental results demonstrate that our approach is scalable and it outperforms state-of-the-art approaches by several orders-of-magnitude in detecting stealthy Trojans.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116454031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
FTT-NAS: Discovering Fault-Tolerant Neural Architecture FTT-NAS:发现容错神经结构
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045324
Xuefei Ning, Guangjun Ge, Wenshuo Li, Zhenhua Zhu, Yin Zheng, Xiaoming Chen, Zhen Gao, Yu Wang, Huazhong Yang
With the fast evolvement of deep-learning specific embedded computing systems, applications powered by deep learning are moving from the cloud to the edge. When deploying NNs onto the edge devices under complex environments, there are various types of possible faults: soft errors caused by atmospheric neutrons and radioactive impurities, voltage instability, aging, temperature variations, and malicious attackers. Thus the safety risk of deploying neural networks at edge computing devices in safety-critic applications is now drawing much attention. In this paper, we implement the random bit-flip, Gaussian, and Salt-and-Pepper fault models and establish a multi-objective fault-tolerant neural architecture search framework. On top of the NAS framework, we propose Fault-Tolerant Neural Architecture Search (FT-NAS) to automatically discover convolutional neural network (CNN) architectures that are reliable to various faults in nowadays edge devices. Then we incorporate fault-tolerant training (FTT) in the search process to achieve better results, which we called FTT-NAS. Experiments show that the discovered architecture FT-NAS-Net and FTT-NAS-Net outperform other hand-designed baseline architectures (58.1%/86.6% VS. 10.0%/52.2%), with comparable FLOPs and less parameters. What is more, the architectures trained under a single fault model can also defend against other faults. By inspecting the discovered architecture, we find that there are redundant connections learned to protect the sensitive paths. This insight can guide future fault-tolerant neural architecture design, and we verify it by a modification on ResNet-20–ResNet-M.
随着深度学习专用嵌入式计算系统的快速发展,由深度学习驱动的应用程序正在从云端向边缘移动。在复杂环境下将神经网络部署到边缘设备上时,可能存在各种类型的故障:大气中子和放射性杂质引起的软错误、电压不稳定、老化、温度变化、恶意攻击等。因此,在安全关键应用的边缘计算设备上部署神经网络的安全风险现在引起了人们的广泛关注。在本文中,我们实现了随机位翻转、高斯和盐胡椒故障模型,并建立了一个多目标容错神经结构搜索框架。在NAS框架的基础上,我们提出了容错神经架构搜索(FT-NAS)来自动发现当前边缘设备中可靠的卷积神经网络(CNN)架构。然后我们在搜索过程中加入容错训练(FTT)来获得更好的结果,我们称之为FTT- nas。实验表明,所发现的体系结构FT-NAS-Net和FTT-NAS-Net优于其他手工设计的基准体系结构(58.1%/86.6% VS. 10.0%/52.2%),具有相当的FLOPs和较少的参数。更重要的是,在单一故障模型下训练的体系结构也可以防御其他故障。通过检查发现的结构,我们发现存在冗余连接来保护敏感路径。本文通过对ResNet-20-ResNet-M的修改,验证了这一观点对未来容错神经网络架构设计的指导作用。
{"title":"FTT-NAS: Discovering Fault-Tolerant Neural Architecture","authors":"Xuefei Ning, Guangjun Ge, Wenshuo Li, Zhenhua Zhu, Yin Zheng, Xiaoming Chen, Zhen Gao, Yu Wang, Huazhong Yang","doi":"10.1109/ASP-DAC47756.2020.9045324","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045324","url":null,"abstract":"With the fast evolvement of deep-learning specific embedded computing systems, applications powered by deep learning are moving from the cloud to the edge. When deploying NNs onto the edge devices under complex environments, there are various types of possible faults: soft errors caused by atmospheric neutrons and radioactive impurities, voltage instability, aging, temperature variations, and malicious attackers. Thus the safety risk of deploying neural networks at edge computing devices in safety-critic applications is now drawing much attention. In this paper, we implement the random bit-flip, Gaussian, and Salt-and-Pepper fault models and establish a multi-objective fault-tolerant neural architecture search framework. On top of the NAS framework, we propose Fault-Tolerant Neural Architecture Search (FT-NAS) to automatically discover convolutional neural network (CNN) architectures that are reliable to various faults in nowadays edge devices. Then we incorporate fault-tolerant training (FTT) in the search process to achieve better results, which we called FTT-NAS. Experiments show that the discovered architecture FT-NAS-Net and FTT-NAS-Net outperform other hand-designed baseline architectures (58.1%/86.6% VS. 10.0%/52.2%), with comparable FLOPs and less parameters. What is more, the architectures trained under a single fault model can also defend against other faults. By inspecting the discovered architecture, we find that there are redundant connections learned to protect the sensitive paths. This insight can guide future fault-tolerant neural architecture design, and we verify it by a modification on ResNet-20–ResNet-M.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129739416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Design for EM Side-Channel Security through Quantitative Assessment of RTL Implementations 通过RTL实现的定量评估来设计EM侧信道安全性
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045426
Jiaji He, Haocheng Ma, Xiaolong Guo, Yiqiang Zhao, Yier Jin
Electromagnetic (EM) side-channel attacks aim at extracting secret information from cryptographic hardware implementations. Countermeasures have been proposed at device level, register-transfer level (RTL) and layout level, though efficient, there are still requirements for quantitative assessment of the hardware implementations’ resistance against EM side-channel attacks. In this paper, we propose a design for EM side-channel security evaluation and optimization framework based on the t-test evaluation results derived from RTL hardware implementations. Different implementations of the same cryptographic algorithm are evaluated under different hypothesis leakage models considering the driven capabilities of logic components, and the evaluation results are validated with side-channel attacks on FPGA platform. Experimental results prove the feasibility of the proposed side-channel leakage evaluation method at pre-silicon stage. The remedies and suggested security design rules are also discussed.
电磁(EM)侧信道攻击的目的是从加密硬件实现中提取秘密信息。在器件级、寄存器-传输级(RTL)和布局级提出了对策,虽然有效,但仍然需要定量评估硬件实现对EM侧信道攻击的抵抗能力。在本文中,我们提出了一个基于RTL硬件实现的t检验评估结果的EM侧信道安全评估和优化框架设计。考虑到逻辑组件的驱动能力,在不同的假设泄漏模型下对同一密码算法的不同实现进行了评估,并在FPGA平台上利用侧信道攻击对评估结果进行了验证。实验结果证明了该方法在预硅阶段的可行性。还讨论了补救措施和建议的安全设计规则。
{"title":"Design for EM Side-Channel Security through Quantitative Assessment of RTL Implementations","authors":"Jiaji He, Haocheng Ma, Xiaolong Guo, Yiqiang Zhao, Yier Jin","doi":"10.1109/ASP-DAC47756.2020.9045426","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045426","url":null,"abstract":"Electromagnetic (EM) side-channel attacks aim at extracting secret information from cryptographic hardware implementations. Countermeasures have been proposed at device level, register-transfer level (RTL) and layout level, though efficient, there are still requirements for quantitative assessment of the hardware implementations’ resistance against EM side-channel attacks. In this paper, we propose a design for EM side-channel security evaluation and optimization framework based on the t-test evaluation results derived from RTL hardware implementations. Different implementations of the same cryptographic algorithm are evaluated under different hypothesis leakage models considering the driven capabilities of logic components, and the evaluation results are validated with side-channel attacks on FPGA platform. Experimental results prove the feasibility of the proposed side-channel leakage evaluation method at pre-silicon stage. The remedies and suggested security design rules are also discussed.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123829589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Improved DD-based Equivalence Checking of Quantum Circuits 改进的基于dd的量子电路等效性检验
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045153
Lukas Burgholzer, R. Wille
Quantum computing is gaining considerable momentum through the recent progress in physical realizations of quantum computers. This led to rather sophisticated design flows in which the originally specified quantum functionality is compiled through different abstractions. This increasingly raises the question whether the respectively resulting quantum circuits indeed realize the originally intended function. Accordingly, efficient methods for equivalence checking are gaining importance. However, existing solutions still suffer from significant shortcomings such as their exponential worst case performance and an increased effort to obtain counterexamples in case of non-equivalence. In this work, we propose an improved DD-based equivalence checking approach which addresses these shortcomings. To this end, we utilize decision diagrams and exploit the fact that quantum operations are inherently reversible - allowing for dedicated strategies that keep the overhead moderate in many cases. Experimental results confirm that the proposed strategies lead to substantial speed-ups – allowing to perform equivalence checking of quantum circuits factors or even magnitudes faster than the state of the art.
通过量子计算机物理实现的最新进展,量子计算正在获得相当大的动力。这导致了相当复杂的设计流程,其中最初指定的量子功能通过不同的抽象来编译。这越来越多地提出了一个问题,即分别产生的量子电路是否确实实现了最初预期的功能。因此,有效的等价检验方法变得越来越重要。然而,现有的解决方案仍然存在明显的缺点,例如它们的指数最坏情况性能以及在非等价情况下获得反例的工作量增加。在这项工作中,我们提出了一种改进的基于dd的等效性检查方法来解决这些缺点。为此,我们利用决策图并利用量子操作固有可逆的事实-允许在许多情况下保持开销适中的专用策略。实验结果证实,所提出的策略导致了实质性的加速-允许执行量子电路因素的等效检查,甚至比目前的技术水平更快。
{"title":"Improved DD-based Equivalence Checking of Quantum Circuits","authors":"Lukas Burgholzer, R. Wille","doi":"10.1109/ASP-DAC47756.2020.9045153","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045153","url":null,"abstract":"Quantum computing is gaining considerable momentum through the recent progress in physical realizations of quantum computers. This led to rather sophisticated design flows in which the originally specified quantum functionality is compiled through different abstractions. This increasingly raises the question whether the respectively resulting quantum circuits indeed realize the originally intended function. Accordingly, efficient methods for equivalence checking are gaining importance. However, existing solutions still suffer from significant shortcomings such as their exponential worst case performance and an increased effort to obtain counterexamples in case of non-equivalence. In this work, we propose an improved DD-based equivalence checking approach which addresses these shortcomings. To this end, we utilize decision diagrams and exploit the fact that quantum operations are inherently reversible - allowing for dedicated strategies that keep the overhead moderate in many cases. Experimental results confirm that the proposed strategies lead to substantial speed-ups – allowing to perform equivalence checking of quantum circuits factors or even magnitudes faster than the state of the art.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114553339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Capacitance Extraction and Power Grid Analysis Using Statistical and AI Methods 利用统计和人工智能方法进行电容提取和电网分析
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045462
Wenjian Yu, Ming Yang, Yao Feng, Ganqu Cui, B. Gu
Capacitance extraction and power grid (PG) analysis for IC design involve large-scale numerical simulation problems. As the process technology becomes more complicated and design margin is shrinking, the capacitance field solver and power-grid matrix solver with high accuracy and capability for handing large and complex structure are highly demanded. In this invited paper, we present recent application of statistical and AI methods in these two fields. The Markov-chain model and relevant analysis are presented for developing an efficient technique for handling conformal dielectrics in the floating random walk based capacitance extraction. Then, two approaches reducing the computational cost of a domain decomposition based power-grid solver are presented. One employs supervised machine learning while the other is inspired by the A*-search algorithm.
集成电路设计中的电容提取和电网分析涉及大规模数值模拟问题。随着工艺技术的复杂化和设计余量的不断缩小,对具有高精度和处理大型复杂结构能力的电容场求解器和电网矩阵求解器提出了很高的要求。在这篇特邀论文中,我们介绍了统计和人工智能方法在这两个领域的最新应用。为了在基于浮动随机游动的电容提取中有效地处理保形介质,提出了马尔可夫链模型和相关分析。然后,提出了两种降低基于域分解的电网求解器计算量的方法。一个采用监督式机器学习,而另一个则受到A*搜索算法的启发。
{"title":"Capacitance Extraction and Power Grid Analysis Using Statistical and AI Methods","authors":"Wenjian Yu, Ming Yang, Yao Feng, Ganqu Cui, B. Gu","doi":"10.1109/ASP-DAC47756.2020.9045462","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045462","url":null,"abstract":"Capacitance extraction and power grid (PG) analysis for IC design involve large-scale numerical simulation problems. As the process technology becomes more complicated and design margin is shrinking, the capacitance field solver and power-grid matrix solver with high accuracy and capability for handing large and complex structure are highly demanded. In this invited paper, we present recent application of statistical and AI methods in these two fields. The Markov-chain model and relevant analysis are presented for developing an efficient technique for handling conformal dielectrics in the floating random walk based capacitance extraction. Then, two approaches reducing the computational cost of a domain decomposition based power-grid solver are presented. One employs supervised machine learning while the other is inspired by the A*-search algorithm.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"15 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126231293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Soft Error and Its Countermeasures in Terrestrial Environment 陆地环境下的软误差及其对策
Pub Date : 2020-01-01 DOI: 10.1109/ASP-DAC47756.2020.9045161
M. Hashimoto, Wang Liao
This paper discusses soft errors in digital chips consisting of SRAM, flip-flops, and combinational logic in the terrestrial environment. We review the effectiveness of error-correction coding (ECC) in processor systems and point out the importance of radiation-hardened flip-flops for further error mitigation. The discussion includes the difference between planar and FD-SOI transistors, and the type of secondary cosmic rays including neutron and muon, using irradiation test results. Also, the difficulty in characterizing SER of a commercial GPU chip is exemplified.
本文讨论了由SRAM、触发器和组合逻辑组成的数字芯片在地面环境中的软误差。我们回顾了纠错编码(ECC)在处理器系统中的有效性,并指出抗辐射触发器对进一步降低错误的重要性。讨论了平面型和FD-SOI型晶体管的区别,以及二次宇宙射线的类型,包括中子和介子。此外,还举例说明了商用GPU芯片SER特性的困难。
{"title":"Soft Error and Its Countermeasures in Terrestrial Environment","authors":"M. Hashimoto, Wang Liao","doi":"10.1109/ASP-DAC47756.2020.9045161","DOIUrl":"https://doi.org/10.1109/ASP-DAC47756.2020.9045161","url":null,"abstract":"This paper discusses soft errors in digital chips consisting of SRAM, flip-flops, and combinational logic in the terrestrial environment. We review the effectiveness of error-correction coding (ECC) in processor systems and point out the importance of radiation-hardened flip-flops for further error mitigation. The discussion includes the difference between planar and FD-SOI transistors, and the type of secondary cosmic rays including neutron and muon, using irradiation test results. Also, the difficulty in characterizing SER of a commercial GPU chip is exemplified.","PeriodicalId":125112,"journal":{"name":"2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128050878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1