Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Challenge] 12 months of data #4

Open
alinapark opened this issue Dec 14, 2022 · 2 comments
Open

[Challenge] 12 months of data #4

alinapark opened this issue Dec 14, 2022 · 2 comments
Assignees
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@alinapark
Copy link
Contributor

alinapark commented Dec 14, 2022

The objective of this challenge is to train a deep learning model to identify the correlation between the time/date of post, the content, and the location. Time zones difference, as well as seasonality of the events, should be analyzed and used to predict the location.

For example: Snow is more likely to appear in the Northern Hemisphere, especially if in December. Rock concerts are more likely to happen in the evening and in bigger cities, so the time of the post about a concert should be used to identify the time zone of the author and narrow down the list of potential locations.

The data set provided is a:

  • .json of >600.000 texts
  • collected over the span of 12 months
  • covering 15 different time zones
  • 6 countries. (Cuba, Iran, Russia, North Korea, Syria, Venezuela).

The data set is here

Deliverable

  • A model which takes a text on the input and returns the coordinates on the output
  • Evaluation metrics obtained on the development dataset, including Mean Absolute Error in kilometers.

We will evaluate the model using the test dataset that is not shared here.

Additional notes

Contact us at [email protected] for any questions or additional requests.

Thank you for contributing to Open Source and making a difference! ʕ•́ᴥ•̀ʔ

@alinapark alinapark added good first issue Good for newcomers help wanted Extra attention is needed labels Dec 14, 2022
@ingakaspar ingakaspar changed the title Challenge 2 - 12 months of data [Challenge] 12 months of data Jun 29, 2023
@AnuravModak
Copy link

is this challenge still open and currently looking for contributions???? @alinapark @ingakaspar

@alinapark alinapark self-assigned this Nov 30, 2023
@smore88
Copy link

smore88 commented Dec 14, 2023

I see that this challenge is open, I will take a crack at it, and provide some updates as well. @alinapark

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants