Welcome!
Understand the NaN and None difference in Pandas once for all Understand the NaN and None difference in Pandas once for all
Pandas and Numpy are widely used formats for data mining and data sciences, but sometimes people get confused by None an
2022-08-24
Common python errors and how to fix them Common python errors and how to fix them
The following is a list of common errors in Python, and knowing how to fix them will save your day. 1. IndentationErrorI
2022-08-15
Best tips for data scientists, most common option setup in python pandas Best tips for data scientists, most common option setup in python pandas
The most commonly used data table tool for data scientists is pandas; through pandas tables, tables (dataframes) and ana
2022-08-06
A great columnTransformer example to preprocess data for online article SOV prediction A great columnTransformer example to preprocess data for online article SOV prediction
ColumnTransformer is one of the most useful data transforming tools for data science projects. Here we show a complete e
2022-07-16
hyperparameter tuning using Optuna with RandomForestClassifier Example (Python code) hyperparameter tuning using Optuna with RandomForestClassifier Example (Python code)
For some popular machine learning algorithms, how to set the hyper parameters could affect machine learning algorithm pe
2022-07-02
How to insert images in DataBricks notebook How to insert images in DataBricks notebook
Following steps show how to upload images to databricks filestore, and insert that path in the notebook cell. upload the
2022-06-22
How to plot mulitiple subplots of barplots using seaborn How to plot mulitiple subplots of barplots using seaborn
Sometimes we want to plot multiple barplots in subplots, this examples shows a nice way to do it. load sample dataimport
2022-05-15
Good use of regular expression, remove spaces between punctuation and words, normalize texts automatically Good use of regular expression, remove spaces between punctuation and words, normalize texts automatically
Regular expression (shortened as regex) is very powerful way to improve efficiency and change formats for us.Here are ex
2022-03-19
Ten important Python libraries that Data Scientist must know Ten important Python libraries that Data Scientist must know
Good libraries are like useful toolbox, and learning these libraries can make you more productive, whether you’re a novi
2022-03-13
Convert arbitrary date to the date of  Monday or Sundy within the same week in Python Convert arbitrary date to the date of Monday or Sundy within the same week in Python
It is common practice to convert date to the week number for normalization or feature preprocessing. While this is prett
2022-02-22
A transformer example to maintain same feature order and add missing features back for feature engineering A transformer example to maintain same feature order and add missing features back for feature engineering
For data science projects, one important steps in feature engineering is to make sure the order of feature columns durin
2022-01-27
9 / 11