Skip links

  • Skip to primary navigation
  • Skip to content
  • Skip to footer
An independent mind...
  • Portfolio
  • Posts
  • Categories
  • Tags
  • About
    Francis T. O'Donovan

    Francis T. O'Donovan

    Data Science Manager at Hospital IQ – Planet discoverer, researcher, developer, geek.

    • Boston, MA
    • Website
    • Email
    • Bitbucket
    • GitHub
    • KeyBase
    • LinkedIn
    • StackOverflow
    • Twitter
    • Email

    Data problems

    less than 1 minute read

    Another nice Medium post from Benjamin Obi Tayo has a good summary of the types of issues you should always be mindful of when you get a new data set:

    1. Wrong Data
    2. Missing Data
    3. Outliers in Data
    4. Redundancy in Data
    5. Unbalanced Data
    6. Lack of Variability in Data
    7. Loss of Data
    8. Dynamic Data
    9. Size of Data

    Tags: data, errors, machine learning, ml, random

    Categories: tips

    Updated: September 11, 2020

    Twitter Facebook LinkedIn
    Previous Next

    Comments

    You May Also Enjoy

    (TIL) Mac: Ask user for password via GUI

    less than 1 minute read

    This function will use AppleScript to present a password entry dialog to make your scripts a little more user friendly:

    (TIL) Mac: Date and Time

    10 minute read

    List Available Timezones:

    (TIL) Mac: Restart or shutdown

    less than 1 minute read

    You can restart or shutdown from the command line:

    (TIL) Nix: Stty - sane terminal settings

    less than 1 minute read

    Restore sane shell settings, in case your shell session went insane because some script or application turned it into a garbled mess:

    • Website
    • Email
    • Bitbucket
    • GitHub
    • KeyBase
    • LinkedIn
    • StackOverflow
    • Twitter
    • Feed
    © 2021 Francis T. O'Donovan. Powered by Jekyll & Minimal Mistakes.