
Best open source integration with Spark
![H2O.ai screenshot - 10 Best Machine Learning Software [2022 List]](https://static.crozdesk.com/web_app_library/categories_providers/screenshots/000/061/095/pub/categories-providers-screenshot-1658862281.png?1658862281)
H2O.ai is a user-friendly, accessible AI platform that was named a Visionary by Gartner in the 2020 Magic Quadrant for Data Science and Machine Learning Platforms. Fraud prevention, anomaly detection, and price optimization are some items they offer. H2O Sparkling Water integrates with Spark for users who want to make a query using Spark SQL, feed the results into H2O to build a model and make predictions, and then use the results again in Spark.
H2O.ai costs from $0.046/hour and offers a free 21-day trial.
Pros:
Big data support with H2O’s Sparkling Water
Flexible modeling including Ensemble
Flexible horizontal scaling via provisioning dynamic clusters
Excellent commitment to open-source transparency
Cons:
More cutting-edge algorithms would be welcome
Some documentation could be refined
Charting and visuals could use