Apache Zeppelin
Apache Zeppelin is an open-source web-based notebook for interactive data analytics, supporting multiple programming languages. The latest version 0.11.1 features Java 11, JDBC connections, and collaborative tools.
Read original articleApache Zeppelin is a web-based notebook designed for interactive data analytics and collaborative documentation, supporting multiple programming languages including SQL, Scala, Python, and R. The latest version, 0.11.1, is built with Java 11 and supports the latest features of Apache Spark and Apache Flink. Zeppelin allows seamless connections to various JDBC data sources such as PostgreSQL, MySQL, and Apache Hive. It features built-in visualizations, dynamic forms, and multi-user support with LDAP for collaborative work. Users can easily create charts and share notebooks in real-time, similar to Google Docs. The platform is open-source and encourages community contributions, with a focus on data ingestion, discovery, analytics, and visualization.
- Apache Zeppelin supports multiple programming languages and interpreters.
- The latest version is built with Java 11 and supports Apache Spark and Flink.
- It allows seamless connections to various JDBC data sources.
- Users can create dynamic forms and visualizations easily.
- Zeppelin is open-source and promotes community involvement.
Related
Show HN: Adding Mistral Codestral and GPT-4o to Jupyter Notebooks
Pretzel is an open-source tool enhancing Jupyter with AI code generation, inline tab completion, sidebar chat, and error fixing. Seamless transition from Jupyter is possible, maintaining compatibility. Installation via 'pip install pretzelai'.
Gravitino: A Powerful Open Data Catalog for Geo-Distributed Metadata Lakes
Apache Gravitino is an incubating project by The Apache Software Foundation, offering a geo-distributed metadata lake for data and AI assets, featuring centralized security and unified management, built with Gradle.
Postgres.new: In-browser Postgres with an AI interface
postgres.new is an in-browser Postgres sandbox that integrates AI assistance for managing databases, supporting features like CSV imports, report generation, and semantic search, with future cost-effective deployments planned.
Zed AI
Zed AI is a new tool enhancing coding productivity with LLM integration, featuring an assistant panel for AI interactions and inline transformations for real-time editing, currently free during launch.
Grafana 11.2 release: new updates for data sources, visualizations, and more
Grafana 11.2 enhances features with over 100 data sources, including Yugabyte and Zendesk, improves dashboards with standardized tooltips and pagination, and upgrades transformations for dynamic data manipulation.
- Users appreciate Zeppelin's interactive features compared to Jupyter, but note it lacks traction and community support.
- Some suggest alternative notebooks like Almond and Polynote for Scala and Spark support.
- There are concerns about Zeppelin's declining usage, with many preferring Jupyter or Databricks for their convenience and popularity.
- Users reminisce about their past experiences with Zeppelin and its integration with Spark.
- Overall, while Zeppelin has unique features, its adoption and development have stalled compared to other tools.
Didn't work out all that well for a number of reasons.
The most important thing is, users are used to Jupyter. Zeppelin's ui is very different, and most people are not willing to jump on yet another learning adventure just for the sake of it.
Then, it's not as widely adopted and supported as JupyterHub- with JupyterHub you can easily integrate whatever you want to. Want several simultaneous jupyters for each user? Sure. Want separate quotas, different k8s namespaces for user groups? Easy. A shitton of plugins? Here you go. A selection of different images for each user, depending on the tooling required? Welcome.
Third thing is really unfortunate, but Zeppelin proved to have a less than stellar stability and performance, at least in my experience. People are wary of something that's often unreliable.
So I've finally decided to just go with JupyterHub, and users can't be happier. Everything's fully customized, things are smooth and familiar to a non-dev crowd.
Another, and in some ways, better solution would be to go with vscode, but I doubt a typical analyst/ds would prefer vscode, at least for now.
All in all, I don't see a place for Zeppelin- it can't compete with what's already on the market and yet doesn't bring anything new and worthwhile.
Toree is mostly dead but might also get a Scala 2.13 release now that Spark 4.0 is approaching.
P.S. I was committer there until changed job.
Not all of them get that much love, but often they have pretty nice functionality.
I still remember that setting up Apache Skywalking was one of the easier ways of getting some APM and tracing in place, compared to the other options out there.
And, of course, the likes of Apache2 and Apache Tomcat are also quite useful in some circumstances.
Zeppelin does make it easier to run Scala Spark, I find, but Scala Spark usage has declined rapidly.
Related
Show HN: Adding Mistral Codestral and GPT-4o to Jupyter Notebooks
Pretzel is an open-source tool enhancing Jupyter with AI code generation, inline tab completion, sidebar chat, and error fixing. Seamless transition from Jupyter is possible, maintaining compatibility. Installation via 'pip install pretzelai'.
Gravitino: A Powerful Open Data Catalog for Geo-Distributed Metadata Lakes
Apache Gravitino is an incubating project by The Apache Software Foundation, offering a geo-distributed metadata lake for data and AI assets, featuring centralized security and unified management, built with Gradle.
Postgres.new: In-browser Postgres with an AI interface
postgres.new is an in-browser Postgres sandbox that integrates AI assistance for managing databases, supporting features like CSV imports, report generation, and semantic search, with future cost-effective deployments planned.
Zed AI
Zed AI is a new tool enhancing coding productivity with LLM integration, featuring an assistant panel for AI interactions and inline transformations for real-time editing, currently free during launch.
Grafana 11.2 release: new updates for data sources, visualizations, and more
Grafana 11.2 enhances features with over 100 data sources, including Yugabyte and Zendesk, improves dashboards with standardized tooltips and pagination, and upgrades transformations for dynamic data manipulation.