Cantitate/Preț
Produs

Programming Massively Parallel Processors: A Hands-on Approach

Autor Wen-Mei W. Hwu, David B. Kirk, Izzat El Hajj
en Limba Engleză Paperback – sep 2026
Programming Massively Parallel Processors: A Hands-on Approach Fifth Edition shows both students and professionals alike the basic concepts of parallel programming and GPU architecture. Concise, intuitive, and practical, it is based on years of road-testing in the authors' own parallel computing courses. Various techniques for constructing and optimizing parallel programs are explored in detail, while case studies demonstrate the development process, which begins with computational thinking and ends with effective and efficient parallel programs. This new edition has been updated with an expanded repertoire of optimizations, new patterns and applications, ad more coverage of important CUDA features.

· Expanded optimization checklist with a more comprehensive demonstration of essential optimizations across patterns
· New pattern and application chapters including: filtering, wavefront parallelism, advanced optimizations for matrix multiplication, and large language models (LLMs)
· More coverage of important CUDA features including warp-level programming, cooperative groups, CUDA C++ atomics, and multi-GPU programming with NCCL and NVSHMEM
Citește tot Restrânge

Toate formatele și edițiile

Toate formatele și edițiile Preț Express
Paperback (2) 40206 lei  2-4 săpt. +18813 lei  5-11 zile
  ELSEVIER SCIENCE – 23 sep 2022 40206 lei  2-4 săpt. +18813 lei  5-11 zile
  ELSEVIER SCIENCE – sep 2026 41749 lei  Precomandă

Preț: 41749 lei

Preț vechi: 52187 lei
-20% Precomandă

Puncte Express: 626

Preț estimativ în valută:
7383 8466$ 6380£

Carte nepublicată încă

Doresc să fiu notificat când acest titlu va fi disponibil:

Specificații

ISBN-13: 9780443439001
ISBN-10: 0443439001
Pagini: 680
Dimensiuni: 191 x 235 mm
Ediția:5
Editura: ELSEVIER SCIENCE

Cuprins

1. Introduction

Part I. Fundamental Concepts
2. Heterogeneous data parallel computing
3. Multidimensional grids and data
4. Compute architecture and scheduling
5. Memory architecture and data locality
6. Performance considerations

Part II. Parallel Patterns
7. Convolution
8. Stencil
9. Parallel histogram
10. Reduction
11. Prefix sum (scan)
12. Merge

Part III. Advanced Patterns and Applications
13. Sorting
14. Filtering (new)
15. Sparse matrix computation
16. Wavefront Algorithms (new)
17. Graph traversal
18. Deep learning
19. Multi-GPU API (new)
20. Electrostatic potential map
21. Parallel programming and computational thinking

Part IV. Advanced Practices
22. Programming a heterogeneous computing cluster
23. Advanced Optimizations for Matrix Multiplication (new)
24. Advanced practices and future evolution
25. Conclusion and outlook