Oct 11, 2021
The final project will ask you to combine several of these topics/techniques to analyze a data sets and produce a web-based data visualization
# Start with the usual imports
# We'll use these throughout
import pandas as pd
import seaborn as sns
from matplotlib import pyplot as plt
Using software to gather and extract data/content from websites
RadPad scraped the entirety of Craiglist, Craigslist sued RadPad, and they were awarded $60 million
<h1>
, <h2>
tags define headers<p>
tags define paragraphs<ol>
and <ul>
are ordered and unordered listsUse the %%html
magic cell command
%%html
<html>
<head>
<title>TITLE GOES HERE</title>
</head>
<body>
<h1>MAIN CONTENT GOES IN THE BODY TAG</h1>
<p>This is a paragraph tag</p>
<p>This is a second paragraph tag</p>
</body>
</html>
Learning the notation:
%%html
<a id="my-link" style="color: orange;" href="https://www.design.upenn.edu" target="blank_">This is my link</a>
The element:
The tag:
The attributes:
id
and class
.red
#some-id
p
, li
, div
And many more: look up the syntax when you need it
Simply hit F12 to load the Web Inspector
JS
or XHR
(XML HTTP requests)Usually looking for .json
files for GeoJSON data