Breaking Up With Flask & FastAPI: Why ML Model Serving Requi...

One of the first things Data Scientists learn as they run predictions is to avoid the use of loops. That’s because most ML libraries support vectorized inference, combining many inputs into a batch and more efficiently calculating the results. This specialized technique combines framework-level features with specialized hardware like GPUs, making p... See more

Breaking Up With Flask & FastAPI: Why ML Model Serving Requires A Specialized Framework

Tim Liu • Breaking Up With Flask & FastAPI: Why ML Model Serving Requires A Specialized Framework