Edge and Cloud-Based AI Inference for Big Data Processing

This chapter explores the strategic implementation of AI inference in the edge-cloud continuum, focusing on the downsides of cloud-based processing and the latency, bandwidth, and scalability advantages of edge-based processing. It describes foundational architectures—edge, fog, and hybrid systems—and assesses the appropriateness of each for real-time big-data settings. It also highlights open issues—data volume, variability in quality, and resource constrains—while also discussing more advanced approaches, including model partitioning or distillation, adaptive inference, compression approaches, and federated learning. The chapter includes case studies with real examples of actual improvements in response times, accuracy, and operational overhead. The chapter concludes with technical challenges and considers future research directions on scalable, privacy-preserving distributed AI systems.

MoreLess

Year of publication:	2026
Authors:	Elmobark, Nagwa
Published in:	Harnessing AI Inference for Intelligent Decision-Making in Real-Time Dataflows. - IGI Global Scientific Publishing, ISBN 9798337370484. - 2026, p. 87-122

More details

Type of publication:	Article
Type of publication (narrower categories):	chapter
Language:	English
Other identifiers:	10.4018/979-8-3373-7046-0.ch004 [DOI]
Source:	Other ZBW resources

Persistent link: https://www.econbiz.de/10015649467