-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hyperdot - Powerful data analysis and creations platform — RFP #1815
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cattania thank you for the application. I notice that the deliverables in the milestones are not compliant with our application template. Could you please adjust to include the mandatory ones, especially in milestones 2 and 3? You can take a look at other applications to see examples of how to describe the milestone deliverables.
I understood that the main objective of the application is to have a tool with great performance. Correct me if I understood wrong. As you already have an MVP, did you already run some benchmarks about the query performance and also the size of the storage in comparison with other tools? If yes, could you share these benchmarks with us or give us an idea about the performance of the tool?
Furthermore, I think the RFP mentioned proposes to have a tool that is flexible like Dune to generate dynamic dashboards that can be shared with other people. I found that is possible to create the queries in the scope of this application, but it will be possible to share them with other people to use it?
Thanks for your reply, I adjusted the milestone compatibility. In terms of performance, I note that Dune customizes the data engine for crypto data, but even so, running complex analytics on tens of millions of blocks of data can be very time consuming, as postgres is not suitable for analytics scenarios. hyperdot currently uses postgres, so it is expected that it will have similar benchmark with performance data to Dune. At the same time, we will establish a good index for block crypto data and enable data compression to further optimize performance. In addition, somewhat different is that hyperdot is more focused on scalability, flexibility, and performance of different analysis scenarios, hyperdot use of multiple data engines to solve this problem. |
@cattania thank you for the answer. What is your long-term vision for this project? This intends to be an open-source library or a business, or do you have some other long-term vision for this project? Do you plan to maintain an index with Polkadot/Kusama data for usage? If so, how do you plan to plan to fund the storage cost? How do you plan for this project to be financially sustainable over time? |
What is your long-term vision for this project? This intends to be an open-source library or a business, or do you have some other long-term vision for this project?In this project, first hyperdot will act as a business. In the coming months, we hope that we can provide users with a SQL query Polkadot/Kusama/Substrate chain data platform for data analysis, at the same time allow can share dashboard. This is exactly what hyperdot describes in this RFP. Next, we want to help get hyperdot in front of more people with the traction of the Foundation. Then, we will continue to improve our technical capabilities, hyperdot will implement the Polkadot/Kusama/Substrate more data model, data analysis engine, query interactions (such as Chatgpt). We are also working to expose hyperdot's indexed data and analytics capabilities to outside developers (like thegraph and subquery) via apis and wasm. These efforts will make hyperdot more competitive. Finally, hyperdot's long-term vision is to become the most competitive platform for on-chain crypto data analysis and creations in web3. Do you plan to maintain an index with Polkadot/Kusama data for usage? If so, how do you plan to plan to fund the storage cost?Yes, this is in line with hyperdot's goals. For storage costs, we plan to obtain funding from the following way
How do you plan for this project to be financially sustainable over time?hyperdot is sustainable because
|
@cattania Thanks for the answer. Are you aware of these two projects that recently received grants from us? Could you please provide a brief comparison of your project with them? Also notice the Data Alliance bounty mentioned in the discussions of these PRs. How could your project fit in this bounty program? It is not launched yet, but it is planned to happen. |
Thanks for your reply. I see #1768 and #1716. #1716 is a etl tool that, coincidentally, we used to maintain ethereum-etl, for a brief comparison
#1768 goes one step further on top of Dott-ETL store data into google BigQuery, similar to how hyperdot provides a post-ETL data engine. It also provides a dashboard, for a brief comparison
About fit Polkadot Data Alliance Bounty, I noticed it and we thought hyperdot very fit
Furthermore, we thought seriously about the Polkadot Data Alliance Bounty idea of "providing a comprehensive, accurate, and accessible data warehouse and rewarding users who contribute to data analysis". hyperdot also aims to provide a multi-chain, multi-data engine, unified sql querying data warehouse, with plans to add creations incentives in the future. |
@cattania thank you for the answer. I marked the application as ready for review. The committee will take a look and can make more questions. We will provide a feedback soon. |
Thank you for your reply. We are ready. In addition, we would like to add a comparison between #1716 and hyperdot. Dot-ETL is built on subquery, while hyperdot is built on subxt. hyperdot is more flexible. Not limited by subquery (e.g. data type of index, supported chain limited) |
@KarimJedda, could you have a look? |
Any progress? |
Thanks for your reply, I guessed it might be because of Decoded 🤣 |
@KarimJedda any update? 🙃 |
Is there any progress on the project? We'd love to hear your feedback |
Took me a while to make time because of another conference (EuroPython) last week and urgent data request for this week, my apologies. High levelAs far as I understand, this project has two components: an indexer (hyperdot-node) and a frontend (hyperdot-fronted-end) to build in essence a Dune analytics for Substrate and the Polkadot ecosystem. From what I can see in the frontend in the POC you shared, that it's currently querying in the browser a Postgres database. What would be beneficial here is to see how that ties in with online charting capabilities, like what Colorful Notion is proposing with the Apache Superset project they're integrating: How do you see users interacting and using the system that would be similar (or different) to Dune? You mention it provides also a post-ETL data engine, leveraging subxt to ingest data directly from the chains themselves. A few general questions:
MilestonesMilestone 1 - Backend
Milestone 2 - FrontendI'm not a frontend expert but I think this looks ok. I'd focus though on very fast iteration and making the frontend agnostic of data sources (ie designing the API the frontend uses in a way that it's not tightly coupled to the storage layer or DB). Meaning, it should work with DuckDB, Postgres, BigQuery etc. Similar to what Apache Superset or Metabase is providing but prettier and more tailored to a "Dune" use case:
That way hyperdot can be used for investigations and people sharing their results, instead of sharing screenshots of dashboards. Just my 2 cents. ConclusionIn general, I would focus on the frontend a whole lot more, it's the main differentiator and user facing capability and I would believe a massive positive point for the community. For the backend, I'd advise considering collaborating with Substrate-ETL (for data directly from chains) or with Dot-ETL together to integrate things like real-time capabilities on top of their solutions. But this is your project, you decide. Believe me when I say it will simplify the infrastructure a lot. Down the line, if you see the need of building a custom backend or indexer for your own tailored use case (as value prop), it would be easy to switch. There's massive heavy lifting and operations involved with the backend part which I believe are under estimated here (don't take it as criticism please) Would you consider building your solution on top of a hybrid of what Substrate-ETL/DOT-ETL is providing and focusing on the frontend / user facing part? Could you also please list the number of chains to be integrated if you end up deciding to do it yourself rather than leverage on substrate-etl/dot-etl? This will help validate the timeline a bit, since it's very ambitious. My recommendation
I'm aware this might go a bit beyond the RFP, but there is some possible synergy with other ecosystem teams here that we should definitely leverage. As software developers, I know it's hard but it'll make things much easier. Going forward I would also recommend:
This is my feedback so far. Hope it helps. |
@KarimJedda thank you for the review. @cattania could you give us feedback about the comments from Karim? Are you willing to incorporate his suggestions in the application? What is your opinion on that? |
Thank you very much for @KarimJedda review. We have seriously thought over Karim suggestions, and there are several main problems at present
Therefore, we adopt part of @KarimJedda suggestion:
Finally, if adopted, We feel hyperdot development costs will come down, and we will change the amount of money we need to fund |
@cattania thank you for the answer. I think it is a reasonable approach. Could you please incorporate these changes in the application document? After that, I think we will be ready for the review of the committee. |
Thanks for your reply, we are in the process of changing the proposal |
@cattania how is it going the changes in the proposal? Any forecast for delivering it? |
yes, we will modify the proposal this week according to previous discussions, focusing more on the integration of UI, substrate-etl and dot-etl |
We have adapted the contents of the proposal in accordance with the previous discussion, including
|
@cattania thank you for the changes. I have some doubts about the application document. I notice that there are three main components, substrate-etl, hyperdot-frontend and hyperdot-node. However, It is not clear to me the relation of these components. Where the data will be stored? How the user will connect with a dashboard? They will need to run a node to be able to see a dashboard? How the data extraction will work? Furthermore, I expected to see some prototypes of the tool. It will enable to use charts in the dashboards or only SQL queries? |
@dsm-w3f Thanks for your reply, the relationship is clear
It allows both chart and SQL queries on the dashboard |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cattania thank you for the answer. I think it is a good approach. Would be nice to have more prototypes to see the screens of the application but as you are confirming that will be possible to query and generate chats as well as share them, it looks good to me. Happy to go forward with the project. I'll also ping the other members of the committee to reevaluate this application.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the application. Could @bytesleak also sign the terms and conditions?
Thank you for your reply, @dsm-w3f We are sure that the query chat can be shared with others, such as generating pictures or other better ways. We are still working on the design prototype. Of course, I will immediately communicate with @bytesleak for signing. Thank you again for your reply |
Congratulations and welcome to the Web3 Foundation Grants Program! Please refer to our Milestone Delivery repository for instructions on how to submit milestones and invoices, our FAQ for frequently asked questions and the support section of our README for more ways to find answers to your questions. |
) * add hyperdot rfc * fix: milestone compatibility and delivery shared dashboards * update: changed somethings in the rfc --------- Co-authored-by: alloctor <[email protected]>
) * add hyperdot rfc * fix: milestone compatibility and delivery shared dashboards * update: changed somethings in the rfc --------- Co-authored-by: alloctor <[email protected]>
Project Abstract
Grant level
Application Checklist
project_name.md
).@tania.infra3:matrix.org
(change the homeserver if you use a different one)