Linear Regression Analysis in Materials Sciences
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Parabolic fit of the conduction band of MoS2 crystal. This code performs linear regression on simulated band structure data for MoS2 crystal. The band structure of MoS2 was calculated in a previous …
Data Science 101 — A Short Course on Medium Platform with R and Python Code Included
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Data Science 101 is intended for individuals that have some prior exposure or knowledge in data science concepts and are interested in practical applications beyond what is offered in most introductory-level data …
New Model for Word Embeddings which are Resilient to Misspellings (MOE)
Author(s): Edward Ma Originally published on Towards AI. Photo by Edward Ma on Unsplash Traditional word embeddings are good at solving lots of natural language processing (NLP) downstream problems such as documentation classification and named-entity recognition (NER). However, one of the drawbacks …
DateTime Manipulations with Python
Author(s): Bindhu Balu Originally published on Towards AI. In Python, date, time, and DateTime classes provide a number of functions to deal with dates, times, and time intervals. Date and DateTime are an object in Python, so when you manipulate them, you …
How does Data Noising Help to Improve your NLP Model?
Author(s): Edward Ma Originally published on Towards AI. The objective of injecting data noising Photo by Edward Ma on Unsplash Introducing data noising to neural network aims at improving model generalization and performance. Xie et al. proposes several ways to generate more …
Training a Machine Learning Model on a Dataset with Highly-Correlated Features
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. 1. Import necessary libraries In the previous article (Feature Selection and Dimensionality Reduction Using Covariance Matrix Plot), we’ve shown that a covariance matrix plot can be used for feature selection and dimensionality …
Hands-on k-fold Cross-validation for Machine Learning Model Evaluation — Cruise Ship Dataset
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. 1. Import necessary libraries In the previous article (Feature Selection and Dimensionality Reduction Using Covariance Matrix Plot), we’ve shown that a covariance matrix plot can be used for feature selection and dimensionality …
Emoticon and Emoji in Text Mining
Author(s): Dhilip Subramanian Originally published on Towards AI. Converting Emoticon and Emoji into word form using Python Source: wallpaperplay In today’s online communication, emojis and emoticons are becoming the primary language that allows us to communicate with anyone globally when you need …
A Gentle Introduction to Graph Embeddings
Author(s): Edward Ma Originally published on Towards AI. TransE Top highlight Photo by Edward Ma on Unsplash Instead of using traditional machine learning classification tasks, we can consider using graph neural network (GNN) to perform node classification problems. By providing an explicit …
Random Walk in Node Embeddings (DeepWalk, node2vec, LINE, and GraphSAGE)
Author(s): Edward Ma Originally published on Towards AI. Graph Embeddings Top highlight Photo by Steven Wei on Unsplash Instead of using traditional machine learning classification tasks, we can consider using graph neural network (GNN) to perform node classification problems. By providing an …
Top Restaurant Finder Nearby
Author(s): Chittal Patel Originally published on Towards AI. Photo by Jay Wennington on Unsplash Introduction In this project, I created a Basic Data Science Project namely Top Restaurant Finder which will give the top Restaurants near your address. I did explore the …
Building a Spam Detector Using Python’s NTLK Package
Author(s): Bindhu Balu Originally published on Towards AI. NTLK — Natural Language ToolKit In this part, we will go through an end to end walkthrough of building a very simple text classifier in Python 3. Our goal is to build a predictive …
4 Graph Neural Networks you Need to Know (WLG, GCN, GAT, GIN)
Author(s): Edward Ma Originally published on Towards AI. Top highlight Photo by Edward Ma on Unsplash We went through Knowledge Graph Embeddings and Random Walk in previous graph neural network stories. Knowledge graph embeddings train entity embeddings for downstream tasks. On the …
Will Your Education Pay You Well?
Author(s): Harsh Darji Originally published on Towards AI. Wage analysis using Random Forest https://pixabay.com/photos/woman-adult-people-money-3261425/ Wage analysis is a process of comparing the salaries based on the attributes attached to the employee. Of course, there are several factors like the company, location which …
Data Science: A Simple Path for Beginners
Author(s): Surya Govind Originally published on Towards AI. How to start? Learn all you need in one year Photo by Austin Distel on Unsplash First, What is Data Science: Data science is an interdisciplinary field that uses scientific methods, processes, algorithms, and …