You can use linear regression for this purpose. The dataset contains information like name, age, sex, number of siblings aboard, etc of about 891 passengers in the training set and 418 passengers in the testing set.ĩ.2 Data Science Project Idea: Build a fun model to predict whether a person would have survived on the Titanic or not. On 15 April 1912, the unsinkable Titanic ship sank and killed 1502 passengers out of 2224. The algorithm that is useful for this purpose is XGboost which stands for extreme gradient boosting, it is based on decision trees.Ĩ.3 Source Code: Machine Learning Project on Detecting Parkinson’s Disease 9. The data is used to separate healthy people from people with Parkinson’s disease.Ĩ.2 Data Science Project Idea: The model can be used to differentiate healthy people from people having Parkinson’s disease. The dataset contains 195 records of people with 23 different attributes which contain biomedical measurements. Parkinson is a nervous system disorder that affects movement. Implement a linear regression model that will be used for predicting height or weight. This dataset can be used to build a model that can predict the heights or weights of a human.ħ.2 Data Science Project Idea: Build a predictive model for determining height or weight of a person. It contains only the height (inches) and weights (pounds) of 25,000 different humans of 18 years of age. The model can be used to predict wine quality.Ħ.2 Data Science Project Idea: Perform various different machine learning algorithms like regression, decision tree, random forests, etc and differentiate between the models and analyse their performances. The dataset is good for classification and regression tasks. It has 4898 instances with 14 variables each. The dataset contains different chemical information about wine. The Passive Aggressive algorithm can classify massive streams of data, it can be implemented quickly.ĥ.3 Source Code: Fake News Detection Python Project 6. The first column identifies news, second for the title, third for news text and fourth is the label TRUE or FAKE.ĥ.1 Data Link: Fake news detection datasetĥ.2 Data Science Project Idea: Build a fake news detection model with Passive Aggressive Classifier algorithm. It is a CSV file that has 7796 rows with 4 columns. Linear regression is used to predict values of unknown input when the data has some linear relationship between input and output variables. You can use this dataset to predict house prices.Ĥ.2 Data Science Project Idea: Predict the housing prices of a new house using linear regression. It has 506 rows and 14 different variables in columns. It contains information about the different houses in Boston based on crime rate, tax, number of rooms, etc. This is a popular dataset used in pattern recognition. This is a perfect dataset to start implementing image classification where you can classify a digit from 0 to 9.ģ.2 Data Science Project Idea: Implement a machine learning classification algorithm on image to recognize handwritten digits from a paper.ģ.3 Source Code: Handwritten Digit Recognition with Deep Learning 4. It contains 60,000 training images and 10,000 testing images. This is a database of handwritten digits. Classification is the task of separating items into its corresponding class. The dataset has 3 classes with 50 instances in each class, therefore, it contains 150 rows with only 4 columns.Ģ.2 Data Science Project Idea: Implement a machine learning classification or regression model on the dataset. The iris dataset is a simple and beginner-friendly dataset that contains information about the flower petal and sepal sizes. It is useful in customised marketing.ġ.3 Source Code: Customer Segmentation Project with Machine Learning 2. Customer segmentation is an important practise of dividing customers base into individual groups that are similar. It collects insights from the data and group customers based on their behaviors.ġ.2 Data Science Project Idea: Segment the customers based on the age, gender, interest. The dataset has gender, customer id, age, annual income, and spending score. The Mall customers dataset contains information about people visiting the mall. Join DataFlair on Telegram!! Machine Learning Datasets for Data Science Beginners Stay updated with latest technology trends
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |