-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Melbourne and Bristol coming up as US only... #16
Comments
Paris comes up as United States, Sydney comes up as Canada.... |
Think of geotext as the general framework on how to extract named entities (low level approach) that are then looked up in an exemplary table of cities. If you want to be able to distinguish between cities in the US, Canada or Australia you could always provide the proper logic in separate lookup tables on your own. |
Thanks @iwpnd iwpnd. I've ended up doing that using geocache So it will come back with a list which has city, country and confidence score. So if you said "I live in London" it would come back with:
London UK gets a higher score because it has a higher population.... That sort of thing. If "Ontario" or "Canada" was in the sentence then that would get a better score. Might upload the code. Thanks for your response, appreciate it. |
I like the idea, thanks for sharing! |
rdlou -- your idea seems great! This is what I ended up doing -- I made a text doc like this: (In case you're wondering about the stopwords I removed, they're words like Franklin |
I was having the same problem. My simple solution was to sort the |
Hi, I am running single cities through the country_mentions func and both of them are coming up only with "OrderedDict([('US', 1)])"
I understand that these are places in the US, but obviously Melbourne is pretty significant in Australia, as is Bristol in the UK. Should the Dict come back with numerous country mentions?
Thanks!
The text was updated successfully, but these errors were encountered: