Skip to content

Label geographic coordinate to district coordinate  #4

@juifa-tsai

Description

@juifa-tsai

According to pre-EDA, the pickup & dropoff locations can be the significant variables for analysis. Those variables are stored with geographic coordinates, i.e. longitude v.s. latitude. However, those values may be non-meaningful and bias in regression method,, since they are for labeling the location mathematically instead for quantity. Thus, finding the meaningful & efficient way to label the location is an issue. Here is an basic idea - Categorized to belonged districts instead of using continue value :

  1. Category types/ranges : Boroughs (5) > Community areas/boards, CB (18max/bor.) > Neighborhoods (?)
  2. Label with encode method to binary bits, e.g 010 110 for one of CB (total 59).

Discussion :

  1. Should we use multi-categories & single category?
  2. Use CB or Neighborhoods?
  3. How to extract neighborhoods numbers?
  4. How to map geographic coord. to particular district category?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions