计算机体系结构

John L. Hennessy,David A. Patterson 机械工业出版社

出版时间：

2012-1

出版社：

机械工业出版社

作者：

John L. Hennessy,David A. Patterson

Tag标签：

无

内容概要

　　《计算机体系结构：量化研究方法（英文版·第5版）》堪称计算机系统结构学科的“圣经”，是计算机设计领域学生和实践者的必读经典。本书系统地介绍了计算机系统的设计基础、存储器层次结构设计、指令级并行及其开发、数据级并行、GPU体系结构、线程级并行和仓库级计算机等。　　现今计算机界处于变革之中：移动客户端和云计算正在成为驱动程序设计和硬件创新的主流范型。因此在这个最新版中，作者考虑到这个巨大的变化，重点关注了新的平台（个人移动设备和仓库级计算机）和新的体系结构（多核和GPU），不仅介绍了移动计算和云计算等新内容，还讨论了成本、性能、功耗、可靠性等设计要素。每章都有两个真实例子，一个来源于手机，另一个来源于数据中心，以反映计算机界正在发生的革命性变革。　　本书内容丰富，既介绍了当今计算机体系结构的最新研究成果，也引述了许多计算机系统设计开发方面的实践经验。另外，各章结尾还附有大量的习题和参考文献。本书既可以作为高等院校计算机专业高年级本科生和研究生学习“计算机体系结构”课程的教材或参考书，也可供与计算机相关的专业人士学习参考。

作者简介

　　John L. Hennessy
斯坦福大学校长，IEEE和ACM会士，美国国家工程研究院院士及美国科学艺术研究院院士。Hennessy教授因为在RISC技术方面做出了突出贡献而荣获2001年的Eckert-Mauchly奖章，他也是2001年Seymour
Cray计算机工程奖得主，并且和本书另外一位作者David A. Patterson分享了2000年John von
Neumann奖。
　　David A. Patterson
加州大学伯克利分校计算机科学系主任、教授，美国国家工程研究院院士，IEEE和ACM会士，曾因成功的启发式教育方法被IEEE授予James
H. Mulligan，Jr.教育奖章。

书籍目录

ForewordPrefaceAcknowledgmentsChapter 1 Fundamentals of Quantitative Design and Analysis1.1 Introduction1.2 Classes of Computers1.3 Defining Computer Architecture1.4 Trends in Technology1.5 Trends in Power and Energy in Integrated Circuits1.6 Trends in Cost1.7 Dependability1.8 Measuring, Reporting, and Summarizing Performance1.9 Quantitative Principles of Computer Design1.10 Putting It All Together: Performance, Price, and Power1.11 Fallacies and Pitfalls1.12 Concluding Remarks1.13 Historical Perspectives and References Case Studies and Exercises by Diana FranklinChapter 2 Memory Hierarchy Design2.1 Introduction2.2 Ten Advanced Optimizations of Cache Performance2.3 Memory Technology and Optimizations2.4 Protection: Virtual Memory and Virtual Machines2.5 Crosscutting Issues: The Design of Memory Hierarchies2.6 Putting It All Together: Memory Hierachies in the ARM Cortex-AS and Intel Core i72.7 Fallacies and Pitfalls2.8 Concluding Remarks: Looking Ahead2.9 Historical Perspective and References Case Studies and Exercises by Norman P. Jouppi, Naveen Muralimanohar, and Sheng LiChapter 3 nstruction-Level Parallelism and Its Exploitation3.1 Instruction-Level Parallelism: Concepts and Challenges3.2 Basic Compiler Techniques for Exposing ILP3.3 Reducing Branch Costs with Advanced Branch Prediction3.4 Overcoming Data Hazards with Dynamic Scheduling3.5 Dynamic Scheduling: Examples and the Algorithm3.6 Hardware-Based Speculation3.7 Exploiting ILP Using Multiple Issue and Static Scheduling3.8 Exploiting ILP Using Dynamic Scheduling, Multiple Issue, and Speculation3.9 Advanced Techniques for Instruction Delivery and Speculation3.10 Studies of the Limitations oflLP3.11 Cross-Cutting Issues: ILP Approaches and the Memory System3.12 Multithreading: Exploiting Thread-Level Parallelism to Improve Uniprocessor Throughput3.13 Putting It All Together: The Intel Core i7 and ARM Cortex-AS3.14 Fallacies and Pitfalls3.15 Concluding Remarks: What's Ahead?3.16 Historical Perspective and References Case Studies and Exercises by Jason D. Bakos and Robert R ColwellChapter4 Data-Level Parallelism in Vector, SIMD, and GPU Architectures4.1 Introduction4.2 Vector Architecture4.3 SIMD Instruction Set Extensions for Multimedia4.4 Graphics Processing Units4.5 Detecting and Enhancing Loop-Level Parallelism4.6 Crosscutting Issues4.7 Putting It All Together: Mobile versus Server GPUS and Tesla versus Core i74.8 Fallacies and Pitfalls4.9 Concluding Remarks4.10 Historical Perspective and References Case Study and Exercises by Jason D. BakosChapter 5 Thread-Level Parallelism5.1 Introduction5.2 Centralized Shared-Memory Architectures5.3 Performance of Symmetric Shared-Memory Multiprocessors……Chapter6 Warehouse-Scale Computers to Exploit Request-Level and Data-Level ParallelismAppendix A Instruction Set PrinciplesAppendix B Review of Memory HierarchyAppendix C Pipelining: Basic and Intermediate Concepts

章节摘录

版权页：插图：The pressure of both a fast clock cycle and power limitations encourages limited size for first-level caches. Similarly, use of lower levels of associativity can reduce both hit time and power, although such trade-offs are more complex than those involving size.The critical timing path in a cache hit is the three-step process of addressing the tag memory using the index portion of the address, comparing the read tag value to the address, and setting the multiplexor to choose the correct data item if the cache is set associative. Direct-mapped caches can overlap the tag check with the transmission of the data, effectively reducing hit time. Furthermore, lower levels of associativity will usually reduce power because fewer cache lines must be accessed.Although the total amount of on-chip cache has increased dramatically with new generations of microprocessors, due to the clock rate impact arising from a larger L1 cache, the size of the L1 caches has recently increased either slightly or not at all. In many recent processors, designers have opted for more associativity rather than larger caches. An additional consideration in choosing the associativity is the possibility of eliminating address aliases; we discuss this shortly.One approach to determining the impact on hit time and power consumption in advance of building a chip is to use CAD tools. CACTI is a program to estimate the access time and energy consumption of alternative cache structures on CMOS microprocessors within 10% of more detailed CAD tools. For a given minimum feature size, CACTI estimates the hit time of caches as cache size varies, associativity, number of read/write ports, and more complex parameters. Figure 2.3 shows the estimated impact on hit time as cache size and associativity are varied.

媒体关注与评论

“本书之所以成为永恒的经典，是因为它的每一次再版都不仅仅是更新补充，而是一次全面的修订，对这个激动人心且快速变化领域给出了最及时的信息和最独到的解读。对于我来说,即使已有二十多年的从业经历，再次阅读本书仍自觉学无止境,感佩于两位卓越大师的渊博学识和深厚功底。” ——Luiz Andre Barroso，Google公司

编辑推荐

《计算机体系结构:量化研究方法(英文版•第5版)》特色：更新相关内容以覆盖移动计算变革，并强调当今体系结构中最重要的两个主题：存储器层次结构和各种并行技术。每章中的“Putting It All Together”小节关注了业界的各种最新技术，包括ARM Cortex-A8、Intel Core i7、NVIDIAGTX-280和GTX-480 GPU，以及一种Google仓库级计算机。每章都设计了常规主题：能力、性能、成本、可依赖性、保护、编程模型和新趋势。书中包括3个附录，另外8个附录可以在原出版社网站在线得到。每章最后都设置了由工业界和学术界专家提供的经过更新的案例研究，以及与之配套的全新练习题。

图书封面

图书标签Tags

无

下载页面

计算机体系结构 PDF格式下载

每天多看你一眼，了解就多一分。第一次见识loop unrolling是在programming pearls上面，然后就很多次见到这个词，这本书给出了更多解释。随着Microsoft, google, Facebook等大量集群系统出现，他们不采用传统数据库系统，而是自行发展自己的软件平台，最后一章对这些进行了精彩的阐述。至于其余:ILP 本书一直以来保留项目,存储器层次结构保留项目,TLP 多处理器时代的主题,DLP 混合架构CPU+GPU嗯，期待能更深刻地理解计算机的工作方式

看了一章，感觉很不错。不过亚马逊发的书有些脏，机械工业出版社的标签被撕掉一半，看起来不像新书。

很厚实，里面看了下还不错，就是教科书类型

书很不错，我们需要的就是这个英文版的

绝对是计算机体系结构方面的圣经啊，书的质量没有原版的好，但是还是不错了，比原版便宜了不少。

英文原版计算机教材，对计算机体系结构有很好的阐述

买的这本书，价格不算便宜。可是书的质量实在不敢恭维。里面有倒页，而且不止一处。我向亚马逊反映，也没有给我满意的答复。如果说书的印刷装订质量有问题，亚马逊没办法保证，那是印刷厂出版社的责任。但如果再有卖家的服务问题，那真是无语了。建议大家买前考虑清楚。

体系结构领域的圣经，还有什么好说呢。

经典之作，值得慢慢读

还没看，不过书的包装和质量还是感觉不错的。希望内容能有一些惊喜

计算机体系结构的圣经。

内容很好，就是关键的词看起来吃力一些

体系结构经典，买过好几版，内容不断在与时俱进。

这本书浅显易懂。块头很大，是因为语言组织的很容易理解！想深入理解计算机结构的人可以好好钻研下这本书！

趁活动买的，经典书籍，就不多评论了

书的印刷还好，书里的插图丰富，每页都留有足够空白位置做笔记，但为什么不像CSAPP那样双色印刷呢？就算贵一点也没关系

经典著作，无需多言。

厚厚一本，经典的书籍，讲的很清晰

还是第三版好些

希望有用处。。。。。。。。。。

经典教材，体系结构专业必备

经典书籍，值得看看

计算机体系结构方面的经典之作

经典书籍，正在研读中

第一图书网

计算机体系结构

相关图书