[WIP] shared perimeter-weighted contiguity #507

knaaptime · 2023-01-12T00:10:22Z

supercedes #506 ; resolves #80

knaaptime · 2023-01-12T00:26:05Z

@sjsrey the last commit implements the logic we talked about today. I can add a test case, or we could have you sanity-check again?

also not too sure what that new kwarg should be called

libpysal/weights/contiguity.py

Co-authored-by: Martin Fleischmann <martin@martinfleischmann.net>

knaaptime · 2023-01-13T22:40:11Z

ive got a notebook to go with this, but running into issues when i create a perimeter-weighted W with use_index=True after setting a meaningful index. I think the issue has to do with how ids are handled in to_adjlist but still investigating

cc: @martinfleis @ljwolf

knaaptime · 2023-01-13T22:45:35Z

import geopandas as gpd
from libpysal.examples import load_example
from libpysal.weights import Rook

us = gpd.read_file(examples.get_path("us48.shp"))

old = Rook.from_dataframe(us)
new = Rook.from_dataframe(us.set_index('STATE_FIPS'), use_index=True)

i thnk the neighbor relationships here are correct, but the indices get screwed up (which causes issues for the perimeter weights, as they rely on a join)

martinfleis · 2023-01-13T22:50:49Z

Did I mess up #477?

knaaptime · 2023-01-13T22:53:16Z

cant say for certain, but i think all of that is ok. I think we may just need to update the to_adjlist method

knaaptime · 2023-01-14T00:04:33Z

im not sure whether its the to_adjlist method to blame, actually. I can get those tables above to match if i reset the index prior to return in the current to_adjlist method, but the problem persists

this notebook runs fine as-is. But, if i explicitly set the index to something like STATE_FIPS in cell 25, then use_index=True with the perimeter weights, the code will run fine, but will produce incorrect results. Still not sure why

sjsrey · 2023-01-16T00:45:27Z

It could be due to to_adjlist relying on id_order

libpysal/libpysal/weights/weights.py

Line 435 in 3a09c42

names = np.asarray(self.id_order)

which we have not yet deprecated.

sjsrey · 2023-01-16T00:55:15Z

Maybe changing:

libpysal/libpysal/weights/weights.py

Lines 434 to 440 in 3a09c42

    
           focal_ix, neighbor_ix = self.sparse.nonzero() 
        
           names = np.asarray(self.id_order) 
        
           focal = names[focal_ix] 
        
           neighbor = names[neighbor_ix] 
        
           weights = self.sparse.data 
        
           adjlist = pandas.DataFrame( 
        
               {focal_col: focal, neighbor_col: neighbor, weight_col: weights}

to

        focal_ix, neighbor_ix = self.sparse.nonzero()
        weights = self.sparse.data
        adjlist = pandas.DataFrame(
            {focal_col: focal_ix, neighbor_col: neighbor_ix, weight_col: weights}

?

sjsrey · 2023-01-16T23:37:33Z

I think the to_adjlist issue is sorted with #510:

knaaptime · 2023-01-16T23:44:03Z

that gets the alignment back in shape so the weights are correct, but we lose the index itself (in the updated example 'focal' is an integer sequence, not fips codes)

sjsrey · 2023-01-16T23:59:07Z

This is because to_adjlist is grabbing the indices from the w.sparse attribute, which only has integer indices.

We also don't have a way back to the df from new once new = Rook.from_dataframe(us.set_index('STATE_FIPS'), use_index=True) is created. We are going to deprecate id_order which currently has the values, so we can't/shouldn't use that.

We could get the STATE_FIPS values out from calling new.full() but that has a smell to it as it defeats the purpose of using the sparse attribute inside of to_adjlist.

In to_adjlist we could jettison focal_ix, neighbor_ix = self.sparse.nonzero()

and instead do something like:

focal_ix = []
neighbor_ix = []
for key, value in new.neighbors.items():
    focal_ix.extend([key] * len(value))
    neighbor_ix.extend(value)

But I think we should get @ljwolf's view on these issues.

knaaptime · 2023-01-17T01:25:42Z

got it

so in the new api, we take the ids kwarg and deprecate the idvariable and id_order. But then we never expose W.ids publicly. I'm not sure whether that was intentional, but it means when we go back and forth from sparse, we have touble redoing the logic e.g. here. We could use use w.neighbors.keys() in place of id_order there since we know the dict is sorted, (but in that case are we keeping id_order_set or some other indicator that we could test against? otherwise we need to update the dict blindly even if unnnecessary)

sjsrey · 2023-01-17T01:31:11Z

got it

so in the new api, we take the ids kwarg and deprecate the idvariable and id_order. But then we never expose W.ids publicly. I'm not sure whether that was intentional, but it means when we go back and forth from sparse, we have touble redoing the logic e.g. here. We could use use w.neighbors.keys() in place of id_order there since we know the dict is sorted, (but in that case are we keeping id_order_set or some other indicator that we could test against? otherwise we need to update the dict blindly even if unnnecessary)

This was a good catch, as it raises a number of issues that are related to the planned deprecation.

knaaptime · 2023-01-17T01:36:30Z

if we are getting rid of id_order_set, and we've decided that exposing W.ids is redundant, i guess when roundtripping to sparse, we could do a simple test with something like list(w.neighbors.keys()) == list(range(len(w.neighbors)) and remap ids if false?

sjsrey · 2023-01-17T01:54:48Z

import geopandas as gpd
from libpysal.examples import load_example
from libpysal.weights import Rook

us = gpd.read_file(examples.get_path("us48.shp"))

old = Rook.from_dataframe(us)
new = Rook.from_dataframe(us.set_index('STATE_FIPS'), use_index=True)
i thnk the neighbor relationships here are correct, but the indices get screwed up (which causes issues for the perimeter weights, as they rely on a join)

to_adjlist sorts before returning. If it did not sort, the returned df would align with what you are expecting:

   focal neighbor  weight
0      53       16     1.0
1      53       41     1.0
2      30       38     1.0
3      30       46     1.0
4      30       56     1.0
..    ...      ...     ...
205    12       01     1.0
206    12       13     1.0
207    26       55     1.0
208    26       18     1.0
209    26       39     1.0

We could add a kw argument to implement this.

knaaptime · 2023-01-17T04:48:05Z

i think we also need to store (and expose publicly) the ids property. We will need that as a mapping any time we roundtrip through a WSP

knaaptime · 2023-01-20T00:19:22Z

i think #511 solves the core issue, so this is probably ready as well.

ljwolf · 2023-12-10T15:24:56Z

To make sure, this has already landed in libpysal.graph, right? This should be closed if I'm reading that correctly, imho, since we should probably freeze new features in libpysal.weights.

knaaptime · 2023-12-10T22:34:10Z

yup, this is stale

knaaptime · 2023-12-11T17:47:55Z

libpysal/weights/contiguity.py

+    )
+
+    # Putting it back to a matrix
+    if perimeter_standardize:


@ljwolf do we want to keep the option to standardize when the boundary isnt exhausted? (so the denom is boundary_i instead of \sum{sharedboundary_ij})?

"rebase" onto latest

c3ee291

knaaptime requested a review from sjsrey January 12, 2023 00:10

knaaptime changed the title ~~"rebase" onto latest~~ shared perimeter-weighted contiguity Jan 12, 2023

allow scaling in perimeter weights

b59bfa0

knaaptime added 2 commits January 11, 2023 16:28

docstring

eff3366

update perimeter tests to include perim_std argument

c795fea

martinfleis reviewed Jan 12, 2023

View reviewed changes

libpysal/weights/contiguity.py Outdated Show resolved Hide resolved

libpysal/weights/contiguity.py Outdated Show resolved Hide resolved

libpysal/weights/contiguity.py Outdated Show resolved Hide resolved

knaaptime and others added 3 commits January 12, 2023 08:26

Update libpysal/weights/contiguity.py

a6b80d1

Co-authored-by: Martin Fleischmann <martin@martinfleischmann.net>

Update libpysal/weights/contiguity.py

998dbe8

Co-authored-by: Martin Fleischmann <martin@martinfleischmann.net>

Update libpysal/weights/contiguity.py

374e8aa

Co-authored-by: Martin Fleischmann <martin@martinfleischmann.net>

knaaptime changed the title ~~shared perimeter-weighted contiguity~~ [WIP] shared perimeter-weighted contiguity Jan 13, 2023

sjsrey approved these changes Jan 13, 2023

View reviewed changes

knaaptime added 2 commits January 13, 2023 15:59

Merge branch 'pysal:master' into perim_update

5b20a2d

add example

14b41e5

sjsrey mentioned this pull request Jan 16, 2023

Removing internal use of id_order in to_adjlist #510

Merged

Merge branch 'pysal:master' into perim_update

7a7b33c

sjsrey requested a review from ljwolf January 17, 2023 00:47

sjsrey mentioned this pull request Jan 17, 2023

Handle set_index properly in to_adjlist #511

Merged

Merge branch 'pysal:master' into perim_update

e37ac66

martinfleis changed the base branch from master to main February 27, 2023 08:45

knaaptime closed this Dec 10, 2023

knaaptime commented Dec 11, 2023

View reviewed changes

knaaptime mentioned this pull request Dec 15, 2023

ENH: shared perimeter contiguity weighting. #80

Closed

[WIP] shared perimeter-weighted contiguity #507

[WIP] shared perimeter-weighted contiguity #507

Uh oh!

Conversation

knaaptime commented Jan 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

knaaptime commented Jan 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

knaaptime commented Jan 13, 2023

Uh oh!

knaaptime commented Jan 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martinfleis commented Jan 13, 2023

Uh oh!

knaaptime commented Jan 13, 2023

Uh oh!

knaaptime commented Jan 14, 2023

Uh oh!

sjsrey commented Jan 16, 2023

Uh oh!

sjsrey commented Jan 16, 2023

Uh oh!

sjsrey commented Jan 16, 2023

Uh oh!

knaaptime commented Jan 16, 2023

Uh oh!

sjsrey commented Jan 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

knaaptime commented Jan 17, 2023

Uh oh!

sjsrey commented Jan 17, 2023

Uh oh!

knaaptime commented Jan 17, 2023

Uh oh!

sjsrey commented Jan 17, 2023

Uh oh!

knaaptime commented Jan 17, 2023

Uh oh!

knaaptime commented Jan 20, 2023

Uh oh!

ljwolf commented Dec 10, 2023

Uh oh!

knaaptime commented Dec 10, 2023

Uh oh!

knaaptime Dec 11, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

knaaptime commented Jan 12, 2023 •

edited

Loading

knaaptime commented Jan 12, 2023 •

edited

Loading

knaaptime commented Jan 13, 2023 •

edited

Loading

sjsrey commented Jan 16, 2023 •

edited

Loading