Remove `patch_method_to_DataFrame` and use piping for functions in `computation.operations` #166

leewujung · 2024-01-25T08:13:17Z

See #164 (review) -- to keep code easily traceable seems better to just use .pipe and not a specialized decorator for these operations.

This is low priority since all functions are working fine now. We can do this after v0.4.2 is released.

The text was updated successfully, but these errors were encountered:

brandynlucca · 2024-01-25T22:26:25Z

Just to provide additional context so I can reference this later to discern the logic/rationale for why I implemented these functions using a sort of monkey patch rather than just using the native pandas.DataFrame.pipe method.

There is certainly overlap between patch_method_to_DataFrame decorator and the associated functions (that will be renamed in via #164, but like discretize_variable) and the flexibility/extension of pandas.DataFrame that .pipe provides. My brain definitely prefers moving through pipes with chained method, e.g. object.function(*args), rather than as independent functions/mutations, e.g. function(object, *args). As you've pointed out, .pipe accomplishes the same task (it's almost as if pipe was intentionally designed that way or something...). So the (kind of) monkey patch feature in patch_method_to_DataFrame is (almost) entirely an aesthetic choice such that the difference for how discretize_variable is written out would be:

# the current (sort of) monkey patched pandas.DataFrame extension approach
specimen_grouped = (
    specimen_df_copy
    .assign( arbitrary_value = 1 )
    .discretize_variable( bin_values = length_intervals , bin_variable = 'length' )
    .assign( group = lambda x: np.where( x['sex'] == int(1) , 'male' , 'female' ) )
    .pipe( lambda df: pd.concat( [ df.loc[ df[ 'sex' ] != 3 ] , df.assign( group = 'all' ) ] ) )
    .assign( station = 2 )
)

# the 'let's not be beholden to Brandyn's mostly trivial and idiosyncratic code aesthetic preferences" approach
specimen_grouped = (
    specimen_df_copy
    .assign( arbitrary_value = 1 )
    .pipe( lambda df: discretize_variable( df , bin_values = length_intervals , bin_variable = 'length' ) )
    .assign( group = lambda x: np.where( x['sex'] == int(1) , 'male' , 'female' ) )
    .pipe( lambda df: pd.concat( [ df.loc[ df[ 'sex' ] != 3 ] , df.assign( group = 'all' ) ] ) )
    .assign( station = 2 )
)

leewujung added the low_priority label Jan 25, 2024

leewujung mentioned this issue Jan 25, 2024

Refactor compute_transect_results #164

Merged

brandynlucca mentioned this issue Mar 28, 2024

Update kriging apportionment method and functions #215

Merged

leewujung added this to Echopop Mar 31, 2024

leewujung added this to the v0.4.1 (docs and data structure update) milestone Mar 31, 2024

leewujung modified the milestones: v0.4.1 (docs, LS fitting), v0.5.0 (refactor bootstrapping) Sep 23, 2024

brandynlucca modified the milestones: bootstrapping, Refactoring and generalization Apr 16, 2025

brandynlucca added the refactor label Apr 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove `patch_method_to_DataFrame` and use piping for functions in `computation.operations` #166

Remove `patch_method_to_DataFrame` and use piping for functions in `computation.operations` #166

Uh oh!

Remove patch_method_to_DataFrame and use piping for functions in computation.operations #166

Remove patch_method_to_DataFrame and use piping for functions in computation.operations #166

Comments

Uh oh!

Remove `patch_method_to_DataFrame` and use piping for functions in `computation.operations` #166

Remove `patch_method_to_DataFrame` and use piping for functions in `computation.operations` #166