KNIME tutorial: Random forest machine learning model to predict Kaggle Titanic (part 2)

KNIME tutorial: Random forest machine learning model to predict Kaggle Titanic (part 2)

  Random Forest Models The random forest model is easy to execute in KNIME.  It is a popular model because it is easy to implement, adaptable, and robust to overfitting.  Random forests are a common way for new people to get started with machine learning.   How do they work?   The random forest worksRead more about KNIME tutorial: Random forest machine learning model to predict Kaggle Titanic (part 2)[…]

KNIME tutorial: Kaggle Titanic machine learning problem data prep and cleaning (part 1)

KNIME tutorial: Kaggle Titanic machine learning problem data prep and cleaning (part 1)

  KNIME Machine Learning Tutorial I  love to help people who are climbing the career ladder, looking to make a switch, or established in their fields, to learn to use more analytics and data science in their work.  One of the things that has fascinated me for years is how people say they want toRead more about KNIME tutorial: Kaggle Titanic machine learning problem data prep and cleaning (part 1)[…]

Statistics Lie (part 5): Sampling on the dependent variable…or why waking up at 4 am won’t make you successful

Statistics Lie (part 5): Sampling on the dependent variable…or why waking up at 4 am won’t make you successful

  Sampling on the dependent variable is something you see all the time if you read clickbait articles like the crap in Business Insider.  These articles typically start with something like, “things all successful people do…” and then make claims about waking up early, or drinking 3 cups of coffee, etc.   If you areRead more about Statistics Lie (part 5): Sampling on the dependent variable…or why waking up at 4 am won’t make you successful[…]