High memory and processing requirements prevent deployment on resource-constrained hardware. Reducing these demands allows complex models to function in real-time environments.