I wanted to share some tips on quickly preparing for the TOEFL iBT test
I had to take TOEFL on a short notice, I had two weeks to take the exam and had prepared using materials that I will share in this post
It helped me get a decent score (106/120) and I wanted to share those incredible materials that helped me with everyone who are looking to ace the exam in a short span of time.
Machine Learning Flashcards from Twitter -- Part 2 Data Analysis and Download
This is the analysis part where we do a small analysis to find
- Which are the most important/popular tweets
- Whether older materials covered important concepts than recent tweets
Load the necessary libs
%load_ext autoreload
%autoreload 2
%matplotlib inline
import pandas as pd
pd.set_option("display.width", 150)
import requests
import re
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
from pathlib import Path
Load csv
df = pd.read_csv("chrisalbon_mlflashcards.csv")
df['text'].fillna('', inplace=True)
df.tail()
id likes replies retweets text timestamp url img_url
236 94607825069... 19 1 0 Bayes Error 2017-12-27T... https://twi... https://pbs...
237 94575108497... 47 3 13 Occams Razor 2017-12-26T... https://twi... https://pbs...
238 94571793723... 8 0 1 K-Fold Cros... 2017-12-26T... https://twi... https://pbs...
239 94538342129... 18 1 1 Extrema 2017-12-25T... https://twi... https://pbs...
240 94536381783... 34 1 7 Softmax Act... 2017-12-25T... https://twi... https://pbs...
Check for missing values
df.count(axis=0)
id 241
likes 241
replies 241
retweets 241
text 241
timestamp 241
url 241
img_url 237
dtype: int64
We see we have few img_urls missing. Let’s check what they are
[Read More]Machine Learning Flashcards from Twitter -- Part 1 Data Collection and Preprocessing
I was searching the net for mlflashcards, I found this incredible machine learning flashcard tweet series from Chris Albon. It looks pretty and covers a lot of ground, Got a thought – why not download them for later use? I thought it would be a fun exercise to start the weekend and jumped into action.
Step 1 – Collect/Scrape data from twitter
I evaluated using twitter api using tweetpy, but it has its own limitation aka we can search only a week worth of data which is not good for our use case. We shoud be able to get data spread across months since the tweets we are interested are spread across a wide time range.
[Read More]