feat: add endpoint for participants with highest measurement count #164

PatrickNercessian · 2024-06-24T15:01:53Z

PatrickNercessian · 2024-06-24T15:05:28Z

Have not added tests yet

This would go hand-in-hand with a change in spark-evaluate to add this Index:

CREATE INDEX stations_day_accepted_count ON daily_stations (day, accepted_count DESC);

What do we think about this approach? Because we're using an aggregate function, the index will not be perfectly helpful. The query will still need to run the full summation and at least partial sorting, I think.

Alternatively, we could create a new table that just updates the total sum for each station_id. However, I lean towards thinking that would be overkill, considering we already cache queries for 24h, so we are rarely running this query.

stats/lib/platform-stats-fetchers.js

stats/lib/platform-routes.js

stats/lib/platform-stats-fetchers.js

PatrickNercessian · 2024-06-26T21:39:21Z

Depends on filecoin-station/spark-evaluate#274

bajtos

The new version looks pretty good.

I'd like to run the SQL query once we have a full day worth of data to see how expensive the query is.

Preliminary plan:

Land the spark-evaluate PR tomorrow and start collecting the data
Wait until Monday
On Monday, test the SQL query to see how it works in practice

stats/lib/platform-stats-fetchers.js

stats/test/platform-routes.test.js

stats/lib/platform-stats-fetchers.js

stats/bin/spark-stats.js

bajtos

You are going in the right direction 👍🏻

stats/lib/platform-stats-fetchers.js

bajtos · 2024-07-03T13:02:01Z

stats/test/platform-routes.test.js

+      // We don't care about the date range for this query
+      const res = await fetch(
+        new URL(
+          '/participants/top-measurements?from=2024-01-11&to=2024-01-11',


We don't care about the date range for this query

As I am arguing in my other comment above, we need to care about the date range.

stats/lib/platform-routes.js

stats/lib/platform-stats-fetchers.js

bajtos

I realised that since we don't use the filter at all, maybe this endpoint shouldn't be implemented via respond(pgPools.evaluate, fetchParticipantsWithTopMeasurements).

Instead, we could write a custom request handler and configure a caching strategy that makes more sense in this particular case - e.g., we can set TTL until midnight UTC.

Anyhow, it's a relatively minor thing, I am fine to land this pull request as it is now.

bajtos · 2024-07-04T07:26:10Z

The CI build is failing in the dry-run step because this PR does not have access to GLIF_TOKEN secret.

Error: server response 401 Unauthorized (
request={  }, 
response={  }, 
error=null, 
info={ "requestUrl": "https://api.node.glif.io/rpc/v0", "responseBody": "<html>\r\n<head><title>401 Authorization Required</title></head>\r\n<body>\r\n<center><h1>401 Authorization Required</h1></center>\r\n</body>\r\n</html>\r\n", "responseStatus": "401 Unauthorized" }, 
code=SERVER_ERROR, 
version=6.13.1
)

feat: add endpoint for stations with highest measurement count

b1f2ddc

PatrickNercessian commented Jun 24, 2024

View reviewed changes

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

stats/lib/platform-routes.js Outdated Show resolved Hide resolved

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

bajtos reviewed Jun 24, 2024

View reviewed changes

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

bajtos requested changes Jun 24, 2024

View reviewed changes

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

PatrickNercessian and others added 2 commits June 26, 2024 17:34

modify query to reflect desired leaderboard, add tests

16d5150

Merge branch 'main' into feat-top-measurement-stations

4f17752

bajtos reviewed Jun 27, 2024

View reviewed changes

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

stats/test/platform-routes.test.js Outdated Show resolved Hide resolved

stats/test/platform-routes.test.js Outdated Show resolved Hide resolved

PatrickNercessian and others added 2 commits June 27, 2024 12:31

Merge branch 'main' into feat-top-measurement-stations

6d160ba

400 response code assertion instead of error, naming conventions

66e4822

bajtos reviewed Jun 28, 2024

View reviewed changes

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

PatrickNercessian marked this pull request as ready for review June 28, 2024 13:43

bajtos mentioned this pull request Jul 1, 2024

feat: index daily_stations by day filecoin-station/spark-evaluate#277

Merged

PatrickNercessian and others added 2 commits July 1, 2024 09:53

Merge branch 'main' into feat-top-measurement-stations

4ecfb3a

switch to materialized view

7178cb2

PatrickNercessian marked this pull request as draft July 1, 2024 14:56

PatrickNercessian commented Jul 1, 2024

View reviewed changes

stats/lib/platform-stats-fetchers.js Outdated Show resolved Hide resolved

add docstring comments

0794b88

bajtos reviewed Jul 1, 2024

View reviewed changes

stats/bin/spark-stats.js Outdated Show resolved Hide resolved

PatrickNercessian added 3 commits July 1, 2024 11:29

remove refresh MV table refresh in favor of doing it in spark-evaluate

aee687f

remove newline artifact from previous commit

0401578

update naming based on spark-evaluate PR

ae99f53

bajtos changed the title ~~feat: add endpoint for stations with highest measurement count~~ feat: add endpoint for participants with highest measurement count Jul 3, 2024

bajtos requested changes Jul 3, 2024

View reviewed changes

force date range query

80da01b

PatrickNercessian marked this pull request as ready for review July 3, 2024 15:20

Merge branch 'main' into feat-top-measurement-stations

464eed6

PatrickNercessian mentioned this pull request Jul 3, 2024

chore: update dependencies #172

Merged

Merge branch 'main' into feat-top-measurement-stations

d1e71ca

juliangruber approved these changes Jul 3, 2024

View reviewed changes

stats/lib/platform-routes.js Outdated Show resolved Hide resolved

stats/lib/platform-stats-fetchers.js Show resolved Hide resolved

stats/lib/platform-stats-fetchers.js Show resolved Hide resolved

fetch function naming update

cf96533

bajtos approved these changes Jul 4, 2024

View reviewed changes

bajtos merged commit 0b56565 into filecoin-station:main Jul 4, 2024
9 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add endpoint for participants with highest measurement count #164

feat: add endpoint for participants with highest measurement count #164

PatrickNercessian commented Jun 24, 2024 •

edited

Loading

PatrickNercessian commented Jun 24, 2024

PatrickNercessian commented Jun 26, 2024

bajtos left a comment

bajtos left a comment

bajtos Jul 3, 2024

bajtos left a comment

bajtos commented Jul 4, 2024

feat: add endpoint for participants with highest measurement count #164

feat: add endpoint for participants with highest measurement count #164

Conversation

PatrickNercessian commented Jun 24, 2024 • edited Loading

PatrickNercessian commented Jun 24, 2024

PatrickNercessian commented Jun 26, 2024

bajtos left a comment

Choose a reason for hiding this comment

bajtos left a comment

Choose a reason for hiding this comment

bajtos Jul 3, 2024

Choose a reason for hiding this comment

bajtos left a comment

Choose a reason for hiding this comment

bajtos commented Jul 4, 2024

PatrickNercessian commented Jun 24, 2024 •

edited

Loading