Cantitate/Preț
Produs

Multimodal Artificial Intelligence and Large Language Models: A Comprehensive Guide from Theory to Practice

Editat de L Ashok Kumar, D Karthika Renuka, Tanvi Banerjee, C. Deisy
en Limba Engleză Hardback – 29 sep 2026
The book provides a comprehensive technical analysis of multimodal artificial intelligence systems and implementation frameworks. It offers thorough coverage of cross-modal processing methods for use, including speech recognition and automatic image captioning.
  • It presents a detailed discussion of architecture for integrating text, image, audio, and video modalities, cross-modal processing pipelines, and data fusion techniques.
  • Showcases real-time synchronization mechanisms across different modalities and scalable design patterns for multimodal systems.
  • Discusses multimodal emotion recognition using deep Learning techniques, focusing on recent advancements, challenges, and ethical considerations.
  • Investigates deployment optimization strategies to address issues with latency, resource usage, and scalability of multimodal systems.
  • Focuses on techniques for performance optimization, memory management, and distributed processing for multimodal workloads using frameworks like PyTorch and TensorFlow.
The text is primarily written for senior undergraduates, graduate students, and academic researchers in electrical engineering, electronics and communications engineering, computer science and engineering, and information technology. 
Citește tot Restrânge

Preț: 86539 lei

Preț vechi: 115686 lei
-25% Precomandă

Puncte Express: 1298

Carte nepublicată încă

Doresc să fiu notificat când acest titlu va fi disponibil:

Specificații

ISBN-13: 9781041152132
ISBN-10: 1041152132
Pagini: 376
Ilustrații: 178
Dimensiuni: 156 x 234 mm
Ediția:1
Editura: CRC Press
Colecția CRC Press

Public țintă

Academic, Postgraduate, and Undergraduate Advanced

Cuprins

Part 1: Emerging Multimodal Artificial Intelligence and Innovations. 1. Introduction to Multimodal Large Language Models. 2. Integration of Large Language Models for Conversational Artificial Intelligence. 3. Navigating Complexity: Challenges and Limitations in Multimodal Artificial Intelligence Models. 4. Integration of Large Language Models for Conversation Artificial Intelligence, and Multimodal Conversational Artificial Intelligence. 5. Privacy and Data Security Concerns in SPAN (Self-Organizing Pervasive Ad-hoc Network) for Multimodal Artificial Intelligence. 6. The Role of Generative Artificial Intelligence in Shaping Multimodal Experiences. 7. Enhancing Privacy and Data Security in Multimodal Large Language Models through Cryptography and Blockchain Technology. Part 2: Global Case Studies and Applications. 8. Applications of Multimodal Artificial Intelligence: Bridging Modalities for Enhanced Intelligence. 9. Multimodal Emotion Recognition with Deep Learning. 10. Bridging Modalities: A Comprehensive Approach to Emotion Recognition. 11. Real-Time Sign Language Recognition and Grammatically Correct, Coherent Sentence Formation Using Deep Learning Techniques. 12. Emotion Detection Across Modalities: A Deep Dive into Multimodal Systems. 13. Multimodal Disentangled Representation Learning for Enhanced User Behavior Analysis in Recommendation Systems. 14. Generative AI in Multimodal Biological Data: Transformations, Techniques, and Future Directions.

Notă biografică

L. Ashok Kumar is Principal at Thiagarajar College of Engineering, Madurai, Tamil Nadu, India. He was a Postdoctoral Research Fellow from San Diego State University, California. He has three years of industrial experience and twenty-three years of academic and research experience. He has published 173 technical papers in International and National journals and presented 167 papers in National and International Conferences. He has developed 27 products, and out of those, 23 products have been technology transferred to industries and for Government funding agencies. His areas of interest include wearable electronics, renewable energy systems, power electronics and drives, and smart grids.D. Karthika Renuka is a Professor in the Department of Information Technology at PSG College of Technology, India. Her professional career of 20 years has been with PSG College of Technology since 2003. She is an Associate Dean (Students Welfare) and a convenor for the Students’ Welfare Committee at PSG College of Technology. She was a Postdoctoral Research Fellow from Wright State University, Ohio, USA. Her area of specialization includes data mining, evolutionary algorithms, soft computing, machine learning, deep learning, affective computing, and computer vision. She has published papers in reputable National and International journals and conferences.Tanvi Banerjee is an Associate Professor in the Department of Computer Science and Engineering at Wright State University, USA, and has a secondary appointment at the Department of Geriatrics at the Boonshoft School of Medicine, Wright State University, USA. Her academic focus has been at the crossroads of artificial intelligence and healthcare, revolving around multimodal data fusion, wearable sensing, and mobile healthcare technologies. She has been a recipient of the prestigious K01 grant awarded by NIH (equivalent to the CAREER pathway in NSF) for the project on Dementia Management using smartphone technologies and a co-investigator in the NIH R01-funded Sickle Cell Disease project. Deisy Chelliah is currently a Professor and the Head of the Information Technology Department at Thiagarajar College of Engineering in Madurai, Tamil Nadu, India. She has twenty-four years of teaching experience. Her areas of interest include machine learning, artificial intelligence, natural language processing, and data mining. She has published more than 90 research articles in journals and conferences.

Descriere

The text presents research trends and challenges in developing multimodal artificial intelligence applications and helps in designing interactive applications such as chatbots, text generation, sentiment analysis, entity recognition, and language translation.