WORKFLOW
Scrape Data from a Website
Easily extract lists and tables from webpages.
Import webpage data
Data on webpages is often stored in list (<li>...</li>) and table (<table>...</table>) elements. You can extract all of the lists and tables on a page using the "Data" element of Import.
Here is a weather forecast webpage:
data:image/s3,"s3://crabby-images/ac64a/ac64ab877af3c3da2b5570b628c54e084723d565" alt=""
Scrape all of the lists and tables on that page:
data:image/s3,"s3://crabby-images/0cf6a/0cf6ac90b5e6fc85bff253afa97d21abdb35a212" alt=""
- Use "FullData" to include empty elements in the scraped data, preserving the complete structures of lists and tables.
Extract the data you want
Pull out just the temperature data:
data:image/s3,"s3://crabby-images/3ed34/3ed34699210a409a8b82bd6f763d5467144ca87a" alt=""
Analyze the data
Plot the temperature data. The “Temp” header in the numerical data is automatically ignored:
Notes
If a URL contains data in a format other than HTML, you can often import the data directly. Here is an import of earthquake data in "CSV" format, which is inferred from the “csv” extension:
data:image/s3,"s3://crabby-images/a093f/a093fc24d54c23ff53b9de4a4c60dc38c8745b4f" alt=""