lingo.lol: About · Profiles directory · Privacy policy
Mastodon: About · Get the app · Keyboard shortcuts · View source code · v4.4.1
#Lasso #LinearRegression "is useful in some contexts due to its tendency to prefer solutions with fewer non-zero coefficients, effectively reducing the number of features upon which the given solution is dependent"
https://scikit-learn.org/stable/modules/linear_model.html#lasso
@data "practitioners can leverage #LASSO regression to construct more interpretable and predictive models that excel in scenarios involving high-dimensional data and intricate feature relationships."
https://datasciencedecoded.com/posts/12_LASSO_Regression_Feature_Selection_Predictive_Models
How to identify and handle duplicate values: https://stackabuse.com/handling-duplicate-values-in-a-pandas-dataframe/
A categorical variable takes on a limited number of values.
The categorical #dataType is useful in the following cases:
- A string variable consisting of only some values. df[["label"]].astype("category") saves memory.
- The lexical order is not the same as the logical order (“one”, “two”, “three”). Sorting and min/max will use the logical order.
- As a signal to other libraries to treat as a category.
More: https://pandas.pydata.org/pandas-docs/stable/user_guide/categorical.html
"Extract Year from a datetime column", by Piyush Raj: https://datascienceparichay.com/article/pandas-extract-year-from-datetime-column/
Mastodon is the best way to keep up with what's happening.
Follow anyone across the fediverse and see it all in chronological order. No algorithms, ads, or clickbait in sight.
Create accountLogin