Data science

Data science is a multi-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledgeand insights from structured and unstructured data.[1][2] Data science is the same concept as data mining and big data: "use the most powerful hardware, the most powerful programming systems, and the most efficient algorithms to solve problems".[3]

Data science is a "concept to unify statisticsdata analysismachine learning and their related methods" in order to "understand and analyze actual phenomena" with data.[4] It employs techniques and theories drawn from many fields within the context of mathematicsstatisticscomputer science, and information scienceTuring award winner Jim Grayimagined data science as a "fourth paradigm" of science (empiricaltheoreticalcomputational and now data-driven) and asserted that "everything about science is changing because of the impact of information technology" and the data deluge.[5][6] In 2015, the American Statistical Associationidentified database management, statistics and machine learning, and distributed and parallel systems as the three emerging foundational professional communities.[7]

Posted on by