Skip to content

usf-cs272-spring2022/homework-HtmlFetcher-template

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HtmlFetcher

Points

For this homework, you will download HTML content from web servers. This homework is broken into the following classes:

  • HttpsFetcher.java: This is a general HTTP/S fetcher and is the same as the lecture code.

  • HtmlFetcher.java: This is a variant of HTTP/S fetcher specifically for HTML content. There are a few methods here you must implement.

Hints

Below are some hints that may help with this homework assignment:

  • It will help to have a HTTP reference. The MDN Web Docs have nice HTTP reference references, including references for HTTP headers and HTTP status codes.

  • Do not fetch the entire page unless necessary! For the most efficient solution, do not directly use HttpsFetcher.fetchURL(URL url) in your implementation. Instead, setup the sockets and get the headers in the same way. Then, based on those headers, decide how to proceed.

  • Some of these methods can be done using regular expressions, but it is not required.

These hints are optional. There may be multiple approaches to solving this homework.

Requirements

See the Javadoc and TODO comments in the template code in the src/main/java directory for additional details. You must pass the tests provided in the src/test/java directory. Do not modify any of the files in the src/test directory.

See the Homework Guides for additional details on homework requirements and submission.

About

HtmlFetcher

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published