LIBRISTO
LIBROAMANTO
mandatory
Become part of a community of book lovers from all over the world and get access to a whole bunch of benefits. Create an account for free
0
DPD courier 4.99 GLS courier 9.99

AI Engineering

Building Multi-Modal Intelligent Systems with Vision, Language, and Audio From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World Deployment

Language EnglishEnglish
Book Paperback
Book AI Engineering Husn Ara
Libristo code: 50705556
Publishers Independently published, August 2025
AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and AudioFrom LLM Fi... Full description
? points 81 b
33.48
In stock at our supplier Shipping in 9-15 days

Up to 30 days for returns


Customers also purchased


AI Engineering: Building Multi-Modal Intelligent Systems with Vision, Language, and Audio
From LLM Fine-Tuning to Voice Agents, AR Interfaces, and Real-World Deployment

Unlock the future of artificial intelligence with practical, production-ready multi-modal engineering.

This hands-on guide is built for developers, researchers, and AI professionals who want to go beyond chatbots and dive into building intelligent systems that understand text, images, audio, and human intent - all in one pipeline.

Whether you're fine-tuning large language models (LLMs) or creating voice-driven AR interfaces, this book walks you through the real engineering decisions, tools, and architectures needed to bring multi-modal AI to life.


What You'll Learn:
  • Fine-tuning Large Language Models (LLMs): Train and adapt models like GPT-2, LLaMA, and Mistral for custom tasks using Hugging Face, LoRA, QLoRA, and PEFT.

  • Voice Interfaces: Combine Whisper, LLMs, and Bark/Tortoise TTS to build interactive speech-driven assistants.

  • Computer Vision + Language: Use models like BLIP, CLIP, and DETR to connect what systems see to what they say and understand.

  • Instruction Tuning & Hyperparameter Optimization: Build smarter, domain-specific models with efficient training workflows.

  • Multi-Modal Pipelines: Chain audio, image, and text inputs for question answering, summarization, tutoring, and AR/robotic control.

  • Real-Time Interfaces: Deploy intelligent agents using FastAPI, Streamlit, Gradio, Docker, and Hugging Face Spaces.

  • Edge & Offline Deployment: Optimize models with ONNX, quantization (4-bit, 8-bit), and TensorRT for low-latency inference on CPU/GPU.


Use Cases Covered:
  • Smart document summarizers with OCR + TTS

  • Voice-enabled image assistants

  • Emotion-aware agents

  • Virtual tutors

  • AR-enhanced AI interfaces

  • Robotic perception + control from voice/image input

  • Secure, multilingual, and privacy-conscious AI systems


Tools & Frameworks Inside:
  • Python, PyTorch, Hugging Face Transformers

  • LangChain, OpenCV, Whisper, TTS, BLIP

  • ROS, Unity (AR/VR), Gradio, Streamlit

  • Docker, FastAPI, gRPC, TorchServe

Built for engineers. Written with depth. Designed for real-world impact.

If you're ready to build intelligent multi-modal agents that understand the world like humans do - across speech, vision, and language - this book gives you the complete roadmap.

Perfect for:
Machine learning engineers, data scientists, AI product developers, researchers, robotics engineers, and anyone building cutting-edge AI systems.

Actress & Polyglot
EWA KASP for
Play video
Ewa Kasp
Libristo has the largest selection of foreign-language books. That’s why I buy my books there.

About the book

Full name AI Engineering
Author Husn Ara
Language English
Binding Book - Paperback
Date of issue 2025
Number of pages 296
EAN 9798296089038
Libristo code 50705556
Weight 400
Dimensions 152 x 229 x 16
Give this book today
It's easy
1 Add to cart and choose Deliver as present at the checkout 2 We'll send you a voucher 3 The book will arrive at the recipient's address

You might also be interested in


Art and Artificial Intelligence Göran Hermerén / Book Paperback
common.buy 26.22
Make It Profitable! Barbara Brabec / Book Paperback
common.buy 17.44
Knowledge and Identity Gabrielle Ivinson / Book Hardback
common.buy 234.35
Flashpoints Michael Napier / Book Hardback
common.buy 38.33
Isma'ili Modern Jonah Steinberg / E-book Adobe ePub DRM
common.buy 36.21
Hungry Babies Fearne Cotton / E-book Adobe ePub DRM
common.buy 5.64
Self Assessment in Rheumatology Yousaf Ali / E-book Adobe ePub DRM
common.buy 99.16
Cat in Wellies Catherine Laidler / Book Paperback
common.buy 17.65
Conversations: Volume 3 Osvaldo Ferrari / Book Paperback
common.buy 20.47
New
AI Driven Swift Architecture Walid SASSI / Book Paperback
common.buy 43.88

Login

Log in to your account. Don't have a Libristo account? Create one now!

 
mandatory
mandatory

Don’t have an account? Discover the benefits of having a Libristo account!

With a Libristo account, you'll have everything under control.

Create a Libristo account
Book advisor Libroamiko
Hi, I'm Libroamiko, can I help?