Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Task 2b: Work Locations (VALIDATION) #33

Open
Hussein-Mahfouz opened this issue Jul 12, 2024 · 0 comments
Open

Task 2b: Work Locations (VALIDATION) #33

Hussein-Mahfouz opened this issue Jul 12, 2024 · 0 comments
Assignees
Labels
Task 2 assigning activities to geographic locations validation Model validation and consistency

Comments

@Hussein-Mahfouz
Copy link
Collaborator

Hussein-Mahfouz commented Jul 12, 2024

I'm currently using two approaches for assigning work locations (see work.py)

1 - select_work_zone_iterative() An iterative approach (iterating over individual activities and filling in a matrix that has constraints in each cell)

2 - select_work_zone_optimization(). Formulated as a linear optimization problem. Objective is minimize deviation of values from reference data. Two forms of deviation: (a) total deviation (sum of all deviations), and (b) maximum deviation among all ODs

Validation metrics

How do we test how well each approach performs? Our reference data (which is being used in both approaches) is the census commuting matrix

Some suggestions from this paper: A neural network approach for population synthesis

Error Based measures (compare degree of difference) 

  • Total Absolute Error
  • Root Mean Square Error (RMSE)
  • Standardized RMSE
  • Mean Absolute Error (MAE) 
  • Population Error Rate

Similarity Measures (compare degree of similarity)

  • Jaccard similarity coefficient
  • Intersection rate
  • Difference rate

I'm not sure how to use the similarity measures, as they are used for binary data. Should the values be converted to binary values based on threshold values (i.e. the predicted flow for an OD is considered the same as the actual flow if they are within x% of each other)?

Other approaches

There are probably better ways to approach this problem. Here are a couple of repos I've come across that use spatial interaction models to constrain a discrete OD matrix.

@Hussein-Mahfouz Hussein-Mahfouz added Task 2 assigning activities to geographic locations validation Model validation and consistency labels Jul 12, 2024
@Hussein-Mahfouz Hussein-Mahfouz self-assigned this Jul 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Task 2 assigning activities to geographic locations validation Model validation and consistency
Projects
None yet
Development

No branches or pull requests

1 participant