Mining Data on the Internet 2020

According to Wikipedia, “data mining is a process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal to extract information (with intelligent methods) from a data set and transform the information into a comprehensible structure for further use. Data mining is the analysis step of the “knowledge discovery in databases” process, or KDD.”

Data mining is a constantly evolving discipline applied in many fields including finance, law, healthcare, marketing, science and engineering, the retail industry, telecommunications, social media, and government. This guide encompasses free, fee based and consultancy related sources to assist info pros, researchers, data analysts, knowledge managers, and CI/BI experts, to effectively identify and apply reliable, value added data within the scope of their respective work products.

Mining Data on the Internet 2020

45 Great Resources for Learning Data Mining Concepts and Techniques 

50 Data Mining Resources – Tutorials, Techniques and More

80legs – Custom Web Crawlers for Crawling and Processing Web Content

2020 Directory of Directories

2020 Guide to Finding Experts By Using the Internet

2020 Guide to Privacy Resources and Tools

2020 Guide to Searching the Internet

2020 New Economy Resources

Advanced Analytics – Unstructured Data Mining

An Evaluation of Data Mining Methods and Tools

An Overview of Data Mining in Road Traffic and Accident Analysis

ACM SIGKDD: Current Explorations Issue

Analytics, Data mining and Data Science

Apache Pig – Platform for Analyzing Large Datasets

Applications of Modern Heuristics and Data Mining Techniques

ARTstor – Digital Image Library for Education and Scholarship

Benchmarking- Data Mining Benchmarking Association

Best Data Mining Tools – Reviews, Pricing and Demos

BI-DW – Business Intelligence and Data Warehousing Directory

Big Data Analytics with Oracle Advanced Analytics

Big Oil Goes Mining for Big Data

Bot Research 2020

Business Intelligence Resources 2020

Calculating Costs of a Data Mining System

CCSU – Data Mining

Center for Automated Learning and Discovery – Machine Learning Department

Cogitum Co-Citer

Contentmine – Text and Data Mining Open Source Tools

Copyright Clearance Center

COREMINE Medical – Biomedical Mindmap

Current Awareness Discovery Tools on the Internet 2020

DataMelt – Computation and Visualization Environment

Data Mining 101 Tools and Techniques

Data Mining Tutorial

Data Engineering Bulletin

DataFerrett – Data Mining Tool

Data Fountains: Open Source Internet Resource Discovery and Metadata/Full-Text Generation Service

Data Mining Amazon Web Services (AWS) Big Data – Data Lakes and Analytics

Data Mining and Knowledge Discovery Journal

Data Mining and Predictive Analytics

Data Mining Applications in Transportation Engineering

Data Mining Case Study – Mining complex financial information

Data Mining Concepts

Data Mining ebook: Theories Algorithms and Examples

Data Mining for the Masses

Data Mining – Federal Efforts Cover a Wide Range of Uses Report

Data Mining Glossary

Data Mining Group (DMG)

Data Mining in Banking and its Applications

Data Mining Oil and Gas Hydrocarbon Exploration Data

Data Mining, Predictive Modeling, Business Analytics: Training, Consulting & Solutions

Data Mining Primer from Oracle

Data Mining Publications from Google

Data Mining Resources 2020

Data Mining Resources

Data Mining Resources

Data Mining Resources

Data Mining Resources at CCSU

Data Mining Table Analysis Tool

Data Mining Techiques in CRM

Data Mining: Technology and Policy The DHS Privacy Office

Data Mining: Text Mining, Visualization and Social Media

Data Mining Tools

Data Mining Tools

Data Mining, Web Scraping, Web Mining, Data Extraction and Screen Scraping Technology Links

Data Mining, Web Mining, and Business Intelligence Solutions from Salford Systems

Data Mining White Paper – Free Best Practices Guide

Data Mining White Paper from Intel – Turning Big Data Into Big Insights

Data Mining – Wikipedia

Datanami – Big Data, Big Analytics, and Big Insights


Data Science Toolkit

Datasets for Data Mining and Data Science

Data Shaping Data Mining Resources

Data Sources

Data Visualizations Derived From Data Mining Big Data

Data Warehousing and Data Mining

DbVisualizer – The Universal Database Tool

DeepDive – Analyze Data On a Deeper Level Than Ever Before

Deep Learning for Java – Open Source, Distributed, Deep Learning Library for the JVM

Deep Web Research and Discovery Resources 2020

Digital Operating Systems Tools and Resources 2019/2020

Data Warehouse, Data Mart, Data Mining and Decision Support Resources

DiscoverText – Capture Text Data and Crunch Your Data

Distributed Data Mining in Credit Card Fraud Detection

Easy Data Mining Software

Easy PDF Cloud

eBiquity Research Group Blogger

Early Canadiana Online

Elastic Web Mining Talk

ELKI: Environment for Developing KDD-Applications Supported by Index-Structures

EU Open Data Portal

Everything You Wanted to Know About Data Mining but Were Afraid to Ask by Alexander Furnas


Google BigQuery – Query Cloud Based Datasets

Google Open Refine 2.0 – Open Source Power Tool for Data Wranglers and Working With Messy Data

Great War Primary Documents Archive

History of Data Mining by Raymond Li 

Howard D. Wactlar Home Page

IBM Data Mining Cognos Business Solutions

Imagination Engines

Indiegogo Datasets

Information Retrieval (IR) and Information Extraction (IE) on the Web Using Hypertext Meta-Data and Structure

InfoVis CyberInfrastructure

International Journal of Business Intelligence and Data Mining (IJBIDM)

International Journal of Data Mining and Bioinformatics (IJDMB)

International Journal of Data Warehousing and Mining (IJDWM)

Internet Archive

Inter-university Consortium for Political and Social Research (ICPSR)

InvestigateIX Search Engine and Text Mining Toolbox

Jaspersoft® ETL – The Open Source Data Integration Platform

Junar – The Open Data Platform 

Kaggle – Go from Big Data to Big Analytics













KDnuggets: Data Mining, Web Mining, and Knowledge Discovery Guide

KEEL (Knowledge Extraction Based on Evolutionary Learning)

Kickstarter Datasets

KNIME – Konstanz Information Miner Open Source Software

Knowledge Discovery Resources 2020

Knowledge Discovery Resources 2020 Annotated White Paper Link Compilation by Marcus P. Zillman, M.S., A.M.H.A.

Knowledge Enterprise Semantic Intelligence Suite

KnowleSys – Web Public Opinion Monitoring

LingPipe – Information Extraction and Data Mining Tools

LoginWorks – Advanced Solutions – Data Mining and Web Scraping

Machine Learning from Scratch 

Mallet – MAchine Learning for LanguagE Toolkit

Marriott Library at the University of Utah Digital Collections

Marti Hearst Home Page

Media Patterns – Detecting Patterns in the Global Media Content

Megaputer – Data Mining and Text Mining Software

Microsoft® Data Mining Project – Efficient Data Exploration and Modeling

Minerazzi – Your Search-and-Mine Ecosystem

Mining Road Traffic Accident Data

Mining Spatial Data of Traffic Accidents

MIT OpenCourseWare study and certification Data Mining Discipline

MOA (Massive Online Analysis)

MoData – Big Data Resources

MonetDB Query Processing at Light Speed

Mozenda – Data Extraction and Comprehensive Web Data Gathering

National Archives, London

National Centre for Text Mining (NaCTeM)

National Science Digital Library (NSDL)

National Technical Information Service (NTIS)

Neural Networks in Data Mining 

Nesstar – Publish Data on the Web

NetOwl – Entity Extraction and Entity Analytics for Big Data

New York Public Library

Nuix – eDiscovery and Electronic Investigation Software

Observatory on Social Media (OSoMe)

Online News Archive 

OntoMiner: Bootstrapping and Populating Ontologies From Domain Specific Web Sites

Open Data Barometer

Open Data Handbook – Guides, Case Studies and Resources for Government and Civil Society On the What, Why and How of Open Data

Open Data Inception

Open Data Institute

Open Data Inventory (ODIN)

Open Data Network

Open Datasets

Open Educational Resources (OER) Sources 2020

OpenMinted – Open Service Oriented e-Infrastructure for Scientific and Scholarly Text and Data Mining

Open/Public Data Sources

Open Source Data Mining Tools

Oracle Data Mining

Oracle Knowledge base about Big Data Mining

Orange – Open Source Data Visualization and Analysis for Novice and Experts

Overview – Open Source Document Mining

PC AI Magazine Artificial Intelligence

Pentaho BI Project – Open Source Business Intelligence

PEPITe S.A. – Unlock Your Knowledge

Prediction Markets 2020

Predictive Model Markup Language (PMML)- Project Info

Predictive Model Markup Language (PMML)

Probabilistic Data Models for Web Analytics and Data Mining

Proxycrawl – Stay Anonymous While Crawling the Web

QDA Miner Lite (Freeware)

QL2 Software – Unstructured Data Management and Web Mining Software

QueryTree – Explore Data Without Code

Raghu Ramakrishnan Home Page

RapidMiner – Open Source Data Mining Tool

Rattle – Data Mining Toolkit in R – 2,000 Data Repositories

Recommended Books on Data Mining


Rexer Analytics – Analytic and CRM Consulting

Ron Kohavi Home Page

SAS – Data and Text Mining

SAS What is Data Mining

SCaVis – Scientific Computation and Visualization Environment

Scientific Data Repository – Real Time Visualization and Exploration Techniques

Screen-Scraper – Data Extraction Software and Services

Searching the Internet 2020

Semantic Scholar – Free Scientific Literature Search and Discovery

SIGKDD – ACM Special Interest Group – Knowledge Discovery in Data and Data Mining

Slideshare Presentations About Data Mining – a List

Slideshare Presentation about Data Mining

Smithsonian/NASA Astrophysics Data System (ADS)

Snorkel: A System for Fast Training Data Creation

Social Buzz Bot 2020 – Business Intelligence Data Mining for Information Discovery from Social Communities [PDF file download]

Software Suites for Data Mining, Analytics, and Knowledge Discovery

Special Interest Group – Knowledge Discovery in Data and Data Mining – SIGKDD Explorations Newsletter

SPMF – Open Source Data Mining Library

Stanford Data Mining Course cs345a course handouts

Statistical Analysis and Data Mining

Statistics Resources and Big Data 2020

Statoo Statistical Consulting + Data Analysis + Data Mining

Streaming Data Mining

Talend Open Data Solutions

Tanagra Project – Free Data Mining Software for Academic and Research Purposes

Text Data Mining

Text Mining for Scholarly Communications and Repositories

The Archaeology Data Service (ADS)

The Centre for Contemporary Canadian Art – Canadian Art Database Project

The Data Mine

The History Data Service (HDS)

The National Centre for Text Mining: Aims and Objectives by Sophia Ananiadou, Julia Chruszcz, John Keane, John McNaught and Paul Watry

The New York Times Article Search API

The Open Access Digital Library

The Ultimate Artificial Intelligence Resources Guide by Kyle Poyar

Togaware – Data Mining Resources

T-Rex (Trainable Relation Extraction)

Try Data Mining Queries Interactively Online using sample dataset

UC Irvine Machine Learning Repository

Udemy Course About Data Mining

University of Florida Digital Collections (UFDC)

University of North Texas Digital Collections

Using the Internet As a Dynamic Resource Tool for Knowledge Discovery 2019

Wallmine – Wall Street Data Mining

Web Curator Tool (WCT) – Management of Selective Web Harvesting Process

Web Data Extractors 2020 – Turn Unstructured Web Content Into Machine-Readable Data Feeds That You Can Consume On Demand

Weka 3: Data Mining Software in Java

Weka 3 – Data Mining with Open Source Machine Learning Software in Java

World Bank Datasets For Data Mining

Zentut – What is Data Mining Tutorial

Posted in: AI, Business Research, Competitive Intelligence, Data Mining, Economy, Financial System, Internet Resources, KM, Legal Research, Open Source