Skip to content

Tokenization is preliminary. Need to extract information from DOM. #1

@DaviesX

Description

@DaviesX

From @DaviesX on February 24, 2017 6:10

A good tool for DOM parsing in C++ is "Xerces-C++"
https://xerces.apache.org/xerces-c/apiDocs-3/annotated.html

Current parsing result:

********* Start testing of UnitTest *********
Config: Using QtTest library 5.6.1, Qt 5.6.1 (x86_64-little_endian-lp64 shared (dynamic) release build; by GCC 6.2.0 20160914)
PASS   : UnitTest::initTestCase()
Processing new document:
Token: Term=[<HTML>,0,0,0,1]
Token: Term=[<HEAD>,0,0,1,1]
Token: Term=[<META,0,0,2,1]
Token: Term=[HTTP-EQUIV="Context-Type",0,0,3,1]
Token: Term=[CONTEXT="text/html;charset=windows-1252">,0,0,4,1]
Token: Term=[<meta,0,0,5,1]
Token: Term=[name="GENERATOR",0,0,6,1]
Token: Term=[content="Microsoft,0,0,7,1]
Token: Term=[Internet,0,0,8,1]
Token: Term=[Assistant,0,0,9,1]
Token: Term=[for,0,0,10,1]
Token: Term=[PowerPoint,0,0,11,1]
Token: Term=[97">,0,0,12,1]
Token: Term=[<TITLE>Learning,0,0,13,1]
Token: Term=[a,0,0,14,1]
Token: Term=[clause</TITLE>,0,0,15,1]
Token: Term=[</HEAD>,0,0,16,1]
Token: Term=[<BODY,0,0,17,1]
Token: Term=[>,0,0,18,1]
Token: Term=[<H1>Learning,0,0,19,1]
Token: Term=[a,0,0,20,1]
Token: Term=[clause</H1>,0,0,21,1]
Token: Term=[<P><UL>,0,0,22,1]
Token: Term=[<LI><H2>State,0,0,23,1]
Token: Term=[Representation:,0,0,24,1]
Token: Term=[</H2>,0,0,25,1]
Token: Term=[</UL><UL>,0,0,26,1]
Token: Term=[<LI><H2>Set,0,0,27,1]
Token: Term=[of,0,0,28,1]
Token: Term=[Positive,0,0,29,1]
Token: Term=[Tuples,0,0,30,1]
Token: Term=[(Po),0,0,31,1]
Token: Term=[</H2>,0,0,32,1]
Token: Term=[<UL>,0,0,33,1]
Token: Term=[<LI>Set,0,0,34,1]
Token: Term=[of,0,0,35,1]
Token: Term=[Negative,0,0,36,1]
Token: Term=[Tuples,0,0,37,1]
Token: Term=[(No),0,0,38,1]
Token: Term=[<LI>Body,0,0,39,1]
Token: Term=[(Conjunction,0,0,40,1]
Token: Term=[of,0,0,41,1]
Token: Term=[Literals),0,0,42,1]
Token: Term=[Initialized,0,0,43,1]
Token: Term=[to,0,0,44,1]
Token: Term=[True,0,0,45,1]
Token: Term=[<LI>Old,0,0,46,1]
Token: Term=[Variables,0,0,47,1]
Token: Term=[</UL></UL><UL>,0,0,48,1]
Token: Term=[<LI><H2>Operators:Add,0,0,49,1]
Token: Term=[a,0,0,50,1]
Token: Term=[new,0,0,51,1]
Token: Term=[literal,0,0,52,1]
Token: Term=[Preds*(vars+arity-1)*arity,0,0,53,1]
Token: Term=[</H2>,0,0,54,1]
Token: Term=[<UL>,0,0,55,1]
Token: Term=[<LI>Using,0,0,56,1]
Token: Term=[every,0,0,57,1]
Token: Term=[predicate,,0,0,58,1]
Token: Term=[and,0,0,59,1]
Token: Term=[every,0,0,60,1]
Token: Term=[way,0,0,61,1]
Token: Term=[of,0,0,62,1]
Token: Term=[inserting,0,0,63,1]
Token: Term=[New,0,0,64,1]
Token: Term=[and,0,0,65,1]
Token: Term=[Old,0,0,66,1]
Token: Term=[Variables,0,0,67,1]
Token: Term=[<LI>Produces,0,0,68,1]
Token: Term=[a,0,0,69,1]
Token: Term=[Set,0,0,70,1]
Token: Term=[of,0,0,71,1]
Token: Term=[Positive,0,0,72,1]
Token: Term=[Tuples,0,0,73,1]
Token: Term=[(P1),,0,0,74,1]
Token: Term=[Negative,0,0,75,1]
Token: Term=[Tuples,0,0,76,1]
Token: Term=[(N1),,0,0,77,1]
Token: Term=[Body,,0,0,78,1]
Token: Term=[Variables,0,0,79,1]
Token: Term=[</UL></UL><UL>,0,0,80,1]
Token: Term=[<LI><H2>Evaluation</H2>,0,0,81,1]
Token: Term=[</UL></P>,0,0,82,1]
Token: Term=[<P></P>,0,0,83,1]
Token: Term=[<P>,0,0,84,1]
Token: Term=[<TABLE>,0,0,85,1]
Token: Term=[<TD,0,0,86,1]
Token: Term=[HEIGHT=100,0,0,87,1]
Token: Term=[WIDTH=100>,0,0,88,1]
Token: Term=[<A,0,0,89,1]
Token: Term=[HREF="tsld014.htm">Previous,0,0,90,1]
Token: Term=[slide</A>,0,0,91,1]
Token: Term=[</TD>,0,0,92,1]
Token: Term=[<TD,0,0,93,1]
Token: Term=[HEIGHT=100,0,0,94,1]
Token: Term=[WIDTH=100>,0,0,95,1]
Token: Term=[<A,0,0,96,1]
Token: Term=[HREF="tsld016.htm">Next,0,0,97,1]
Token: Term=[slide</A>,0,0,98,1]
Token: Term=[</TD>,0,0,99,1]
Token: Term=[<TD,0,0,100,1]
Token: Term=[HEIGHT=100,0,0,101,1]
Token: Term=[WIDTH=150>,0,0,102,1]
Token: Term=[<A,0,0,103,1]
Token: Term=[HREF="tsld001.htm">Back,0,0,104,1]
Token: Term=[to,0,0,105,1]
Token: Term=[first,0,0,106,1]
Token: Term=[slide</A>,0,0,107,1]
Token: Term=[</TD>,0,0,108,1]
Token: Term=[<TD,0,0,109,1]
Token: Term=[HEIGHT=100,0,0,110,1]
Token: Term=[WIDTH=150>,0,0,111,1]
Token: Term=[<A,0,0,112,1]
Token: Term=[HREF="sld015.htm">View,0,0,113,1]
Token: Term=[graphic,0,0,114,1]
Token: Term=[version</A>,0,0,115,1]
Token: Term=[</TD>,0,0,116,1]
Token: Term=[</TABLE>,0,0,117,1]
Token: Term=[<BR>,0,0,118,1]
Token: Term=[</P>,0,0,119,1]
Token: Term=[</Body>,0,0,120,1]
Token: Term=[</HTML>,0,0,121,1]
Token: Term=[,0,0,122,1]
Processing new document:
Token: Term=[<html>,0,0,0,1]
Token: Term=[<head>,0,0,1,1]
Token: Term=[<title>CS,0,0,2,1]
Token: Term=[266,0,0,3,1]
Token: Term=[-,0,0,4,1]
Token: Term=[Computational,0,0,5,1]
Token: Term=[Geometry,0,0,6,1]
Token: Term=[Homework,0,0,7,1]
Token: Term=[6</title>,0,0,8,1]
Token: Term=[</head>,0,0,9,1]
Token: Term=[<body,0,0,10,1]
Token: Term=[bgcolor="#FFFFFF">,0,0,11,1]
Token: Term=[<center>,0,0,12,1]
Token: Term=[<h2>CS,0,0,13,1]
Token: Term=[266,0,0,14,1]
Token: Term=[-,0,0,15,1]
Token: Term=[Computational,0,0,16,1]
Token: Term=[Geometry,0,0,17,1]
Token: Term=[Homework,0,0,18,1]
Token: Term=[6,,0,0,19,1]
Token: Term=[50,0,0,20,1]
Token: Term=[Points,0,0,21,1]
Token: Term=[<br>,0,0,22,1]
Token: Term=[Due:,0,0,23,1]
Token: Term=[Monday,,0,0,24,1]
Token: Term=[February,0,0,25,1]
Token: Term=[29,,0,0,26,1]
Token: Term=[11:55pm</h2>,0,0,27,1]
Token: Term=[<br>,0,0,28,1]
Token: Term=[Assignments,0,0,29,1]
Token: Term=[must,0,0,30,1]
Token: Term=[be,0,0,31,1]
Token: Term=[typed,0,0,32,1]
Token: Term=[and,0,0,33,1]
Token: Term=[turned,0,0,34,1]
Token: Term=[in,0,0,35,1]
Token: Term=[using,0,0,36,1]
Token: Term=[the,0,0,37,1]
Token: Term=[<a,0,0,38,1]
Token: Term=[href="https://eee.uci.edu/">EEE</a>,0,0,39,1]
Token: Term=[system.</h2>,0,0,40,1]
Token: Term=[</center>,0,0,41,1]
Token: Term=[<p>,0,0,42,1]
Token: Term=[<ol>,0,0,43,1]
Token: Term=[<li>,0,0,44,1]
Token: Term=[<i>10,0,0,45,1]
Token: Term=[points.</i>,0,0,46,1]
Token: Term=[Suppose,0,0,47,1]
Token: Term=[you,0,0,48,1]
Token: Term=[are,0,0,49,1]
Token: Term=[given,0,0,50,1]
Token: Term=[two,0,0,51,1]
Token: Term=[sets,,0,0,52,1]
Token: Term=[A,0,0,53,1]
Token: Term=[and,0,0,54,1]
Token: Term=[B,,0,0,55,1]
Token: Term=[of,0,0,56,1]
Token: Term=[n,0,0,57,1]
Token: Term=[points,0,0,58,1]
Token: Term=[each.,0,0,59,1]
Token: Term=[Describe,0,0,60,1]
Token: Term=[an,0,0,61,1]
Token: Term=[O(n,0,0,62,1]
Token: Term=[log,0,0,63,1]
Token: Term=[n)-time,0,0,64,1]
Token: Term=[algorithm,0,0,65,1]
Token: Term=[to,0,0,66,1]
Token: Term=[find,0,0,67,1]
Token: Term=[the,0,0,68,1]
Token: Term=[nearest,0,0,69,1]
Token: Term=[neighbor,0,0,70,1]
Token: Term=[in,0,0,71,1]
Token: Term=[B,0,0,72,1]
Token: Term=[for,0,0,73,1]
Token: Term=[each,0,0,74,1]
Token: Term=[point,0,0,75,1]
Token: Term=[in,0,0,76,1]
Token: Term=[A.,0,0,77,1]
Token: Term=[<li>,0,0,78,1]
Token: Term=[<i>10,0,0,79,1]
Token: Term=[points.</i>,0,0,80,1]
Token: Term=[Problem,0,0,81,1]
Token: Term=[7.5,0,0,82,1]
Token: Term=[from,0,0,83,1]
Token: Term=[de,0,0,84,1]
Token: Term=[Berg,0,0,85,1]
Token: Term=[et,0,0,86,1]
Token: Term=[al.,0,0,87,1]
Token: Term=[<li>,0,0,88,1]
Token: Term=[<i>10,0,0,89,1]
Token: Term=[points.</i>,0,0,90,1]
Token: Term=[Problem,0,0,91,1]
Token: Term=[7.7,0,0,92,1]
Token: Term=[from,0,0,93,1]
Token: Term=[de,0,0,94,1]
Token: Term=[Berg,0,0,95,1]
Token: Term=[et,0,0,96,1]
Token: Term=[al.,0,0,97,1]
Token: Term=[<li>,0,0,98,1]
Token: Term=[<i>10,0,0,99,1]
Token: Term=[points.</i>,0,0,100,1]
Token: Term=[Problem,0,0,101,1]
Token: Term=[7.11,0,0,102,1]
Token: Term=[from,0,0,103,1]
Token: Term=[de,0,0,104,1]
Token: Term=[Berg,0,0,105,1]
Token: Term=[et,0,0,106,1]
Token: Term=[al.,0,0,107,1]
Token: Term=[<li>,0,0,108,1]
Token: Term=[<i>10,0,0,109,1]
Token: Term=[points.</i>,0,0,110,1]
Token: Term=[Problem,0,0,111,1]
Token: Term=[7.12,0,0,112,1]
Token: Term=[from,0,0,113,1]
Token: Term=[de,0,0,114,1]
Token: Term=[Berg,0,0,115,1]
Token: Term=[et,0,0,116,1]
Token: Term=[al.,0,0,117,1]
Token: Term=[</ol>,0,0,118,1]
Token: Term=[</body>,0,0,119,1]
Token: Term=[</html>,0,0,120,1]
Token: Term=[,0,0,121,1]
Processing new document:
Token: Term=[<!DOCTYPE,0,0,0,1]
Token: Term=[HTML,0,0,1,1]
Token: Term=[PUBLIC,0,0,2,1]
Token: Term=["-//IETF//DTD,0,0,3,1]
Token: Term=[HTML,0,0,4,1]
Token: Term=[2.0//EN">,0,0,5,1]
Token: Term=[<HTML>,0,0,6,1]
Token: Term=[<HEAD>,0,0,7,1]
Token: Term=[<META,0,0,8,1]
Token: Term=[HTTP-EQUIV="GENERATOR",0,0,9,1]
Token: Term=[CONTENT="Globetrotter,0,0,10,1]
Token: Term=[1.1.1">,0,0,11,1]
Token: Term=[<META,0,0,12,1]
Token: Term=[HTTP-EQUIV="AUTHOR",0,0,13,1]
Token: Term=[CONTENT="David,0,0,14,1]
Token: Term=[G.,0,0,15,1]
Token: Term=[Kay">,0,0,16,1]
Token: Term=[<META,0,0,17,1]
Token: Term=[HTTP-EQUIV="UPDATED",0,0,18,1]
Token: Term=[CONTENT="Monday,,0,0,19,1]
Token: Term=[June,0,0,20,1]
Token: Term=[23,,0,0,21,1]
Token: Term=[2003,0,0,22,1]
Token: Term=[6:10,0,0,23,1]
Token: Term=[PM">,0,0,24,1]
Token: Term=[<TITLE>Editing,0,0,25,1]
Token: Term=[the,0,0,26,1]
Token: Term=[R&eacute;sum&eacute;,0,0,27,1]
Token: Term=[and,0,0,28,1]
Token: Term=[Cover,0,0,29,1]
Token: Term=[Letter</TITLE>,0,0,30,1]
Token: Term=[<META,0,0,31,1]
Token: Term=[HTTP-EQUIV="X-GLOBETROTTERDATA",0,0,32,1]
Token: Term=[CONTENT="93248D94">,0,0,33,1]
Token: Term=[<META,0,0,34,1]
Token: Term=[HTTP-EQUIV=KEYWORDS,0,0,35,1]
Token: Term=[CONTENT="Peer,0,0,36,1]
Token: Term=[editing,,0,0,37,1]
Token: Term=[writing,0,0,38,1]
Token: Term=[assignments,,0,0,39,1]
Token: Term=[resume,,0,0,40,1]
Token: Term=[cover,0,0,41,1]
Token: Term=[letter">,0,0,42,1]
Token: Term=[<META,0,0,43,1]
Token: Term=[HTTP-EQUIV="DESCRIPTION",0,0,44,1]
Token: Term=[CONTENT="Peer,0,0,45,1]
Token: Term=[editing,0,0,46,1]
Token: Term=[guidelines,0,0,47,1]
Token: Term=[for,0,0,48,1]
Token: Term=[writing,0,0,49,1]
Token: Term=[a,0,0,50,1]
Token: Term=[r&eacute;sum&eacute;,0,0,51,1]
Token: Term=[and,0,0,52,1]
Token: Term=[cover,0,0,53,1]
Token: Term=[letter,0,0,54,1]
Token: Term=[in,0,0,55,1]
Token: Term=[ICS,0,0,56,1]
Token: Term=[139W,,0,0,57,1]
Token: Term=[Communication,0,0,58,1]
Token: Term=[Skills,0,0,59,1]
Token: Term=[for,0,0,60,1]
Token: Term=[Computer,0,0,61,1]
Token: Term=[Scientists,,0,0,62,1]
Token: Term=[an,0,0,63,1]
Token: Term=[upper,0,0,64,1]
Token: Term=[division,0,0,65,1]
Token: Term=[writing,0,0,66,1]
Token: Term=[course,0,0,67,1]
Token: Term=[in,0,0,68,1]
Token: Term=[the,0,0,69,1]
Token: Term=[Department,0,0,70,1]
Token: Term=[of,0,0,71,1]
Token: Term=[Information,0,0,72,1]
Token: Term=[and,0,0,73,1]
Token: Term=[Computer,0,0,74,1]
Token: Term=[Science,,0,0,75,1]
Token: Term=[UC,0,0,76,1]
Token: Term=[Irvine.">,0,0,77,1]
Token: Term=[<META,0,0,78,1]
Token: Term=[HTTP-EQUIV="COPYRIGHT",0,0,79,1]
Token: Term=[CONTENT="Copyright,0,0,80,1]
Token: Term=[&#169;,0,0,81,1]
Token: Term=[2000,0,0,82,1]
Token: Term=[by,0,0,83,1]
Token: Term=[David,0,0,84,1]
Token: Term=[G.,0,0,85,1]
Token: Term=[Kay.,0,0,86,1]
Token: Term=[All,0,0,87,1]
Token: Term=[rights,0,0,88,1]
Token: Term=[reserved.">,0,0,89,1]
Token: Term=[<link,0,0,90,1]
Token: Term=[href="/~kay/courses/139w/mainstyle.css",0,0,91,1]
Token: Term=[rel="stylesheet",0,0,92,1]
Token: Term=[type="text/css",0,0,93,1]
Token: Term=[/>,0,0,94,1]
Token: Term=[</HEAD>,0,0,95,1]
Token: Term=[<BODY,0,0,96,1]
Token: Term=[BGCOLOR="#FFFFFF">,0,0,97,1]
Token: Term=[<P><font,0,0,98,1]
Token: Term=[>Spring,0,0,99,1]
Token: Term=[2013,0,0,100,1]
Token: Term=[&mdash;,0,0,101,1]
Token: Term=[<a,0,0,102,1]
Token: Term=[href="http://www.uci.edu/">UC,0,0,103,1]
Token: Term=[Irvine</a>,0,0,104,1]
Token: Term=[&mdash;,0,0,105,1]
Token: Term=[<a,0,0,106,1]
Token: Term=[href="http://www.ics.uci.edu/">Information,0,0,107,1]
Token: Term=[&amp;,0,0,108,1]
Token: Term=[Computer,0,0,109,1]
Token: Term=[Science</a>,0,0,110,1]
Token: Term=[&mdash;,0,0,111,1]
Token: Term=[<a,0,0,112,1]
Token: Term=[href="http://www.ics.uci.edu/~kay/courses/139w/">ICS,0,0,113,1]
Token: Term=[139W</a>,0,0,114,1]
Token: Term=[&mdash;,0,0,115,1]
Token: Term=[<a,0,0,116,1]
Token: Term=[href="http://www.ics.uci.edu/~kay/">David,0,0,117,1]
Token: Term=[G.,0,0,118,1]
Token: Term=[Kay</a></font></P>,0,0,119,1]
Token: Term=[<P><font,0,0,120,1]
Token: Term=[size="5",0,0,121,1]
Token: Term=[><strong>R&eacute;sum&eacute;,0,0,122,1]
Token: Term=[and,0,0,123,1]
Token: Term=[Cover,0,0,124,1]
Token: Term=[Letter:,0,0,125,1]
Token: Term=[</strong>Peer,0,0,126,1]
Token: Term=[Editing,0,0,127,1]
Token: Term=[Guidelines</font></P>,0,0,128,1]
Token: Term=[<P>,0,0,129,1]
Token: Term=[<FONT,0,0,130,1]
Token: Term=[SIZE=4>Because,0,0,131,1]
Token: Term=[this,0,0,132,1]
Token: Term=[assignment,0,0,133,1]
Token: Term=[is,0,0,134,1]
Token: Term=[very,0,0,135,1]
Token: Term=[short,,0,0,136,1]
Token: Term=[you,0,0,137,1]
Token: Term=[should,0,0,138,1]
Token: Term=[have,0,0,139,1]
Token: Term=[<I>at,0,0,140,1]
Token: Term=[least,0,0,141,1]
Token: Term=[two,0,0,142,1]
Token: Term=[of,0,0,143,1]
Token: Term=[your,0,0,144,1]
Token: Term=[classmates</I>,0,0,145,1]
Token: Term=[edit,0,0,146,1]
Token: Term=[it,0,0,147,1]
Token: Term=[(and,0,0,148,1]
Token: Term=[you,0,0,149,1]
Token: Term=[should,0,0,150,1]
Token: Term=[edit,0,0,151,1]
Token: Term=[two,0,0,152,1]
Token: Term=[of,0,0,153,1]
Token: Term=[your,0,0,154,1]
Token: Term=[classmates&#39;,0,0,155,1]
Token: Term=[papers).,0,0,156,1]
Token: Term=[Try,0,0,157,1]
Token: Term=[to,0,0,158,1]
Token: Term=[work,0,0,159,1]
Token: Term=[with,0,0,160,1]
Token: Term=[people,0,0,161,1]
Token: Term=[you,0,0,162,1]
Token: Term=[haven&#39;t,0,0,163,1]
Token: Term=[worked,0,0,164,1]
Token: Term=[with,0,0,165,1]
Token: Term=[before.,0,0,166,1]
Token: Term=[(There&#39;s,0,0,167,1]
Token: Term=[a,0,0,168,1]
Token: Term=[separate,0,0,169,1]
Token: Term=[page,0,0,170,1]
Token: Term=[of,0,0,171,1]
Token: Term=[guidelines,0,0,172,1]
Token: Term=[for,0,0,173,1]
Token: Term=[the,0,0,174,1]
Token: Term=[promotion,0,0,175,1]
Token: Term=[piece;,0,0,176,1]
Token: Term=[consult,0,0,177,1]
Token: Term=[that,0,0,178,1]
Token: Term=[one,0,0,179,1]
Token: Term=[if,0,0,180,1]
Token: Term=[you&#39;re,0,0,181,1]
Token: Term=[doing,0,0,182,1]
Token: Term=[that,0,0,183,1]
Token: Term=[alternative.),0,0,184,1]
Token: Term=[</FONT></P>,0,0,185,1]
Token: Term=[<OL,0,0,186,1]
Token: Term=[TYPE="I">,0,0,187,1]
Token: Term=[<LI>,0,0,188,1]
Token: Term=[<FONT,0,0,189,1]
Token: Term=[SIZE=4>Talk,0,0,190,1]
Token: Term=[to,0,0,191,1]
Token: Term=[the,0,0,192,1]
Token: Term=[author.,0,0,193,1]
Token: Term=[What,0,0,194,1]
Token: Term=[kind,0,0,195,1]
Token: Term=[of,0,0,196,1]
Token: Term=[job,0,0,197,1]
Token: Term=[does,0,0,198,1]
Token: Term=[the,0,0,199,1]
Token: Term=[author,0,0,200,1]
Token: Term=[want?,0,0,201,1]
Token: Term=[What,0,0,202,1]
Token: Term=[does,0,0,203,1]
Token: Term=[the,0,0,204,1]
Token: Term=[author,0,0,205,1]
Token: Term=[think,0,0,206,1]
Token: Term=[are,0,0,207,1]
Token: Term=[his,0,0,208,1]
Token: Term=[or,0,0,209,1]
Token: Term=[her,0,0,210,1]
Token: Term=[strongest,0,0,211,1]
Token: Term=[points,0,0,212,1]
Token: Term=[or,0,0,213,1]
Token: Term=[best,0,0,214,1]
Token: Term=[qualifications?,0,0,215,1]
Token: Term=[weakest,0,0,216,1]
Token: Term=[points,0,0,217,1]
Token: Term=[or,0,0,218,1]
Token: Term=[shortcomings?</FONT></LI>,0,0,219,1]
Token: Term=[<BR>,0,0,220,1]
Token: Term=[<LI>,0,0,221,1]
Token: Term=[<FONT,0,0,222,1]
Token: Term=[SIZE=4>Read,0,0,223,1]
Token: Term=[your,0,0,224,1]
Token: Term=[classmate&#39;s,0,0,225,1]
Token: Term=[work,0,0,226,1]
Token: Term=[once,0,0,227,1]
Token: Term=[through,0,0,228,1]
Token: Term=[without,0,0,229,1]
Token: Term=[making,0,0,230,1]
Token: Term=[any,0,0,231,1]
Token: Term=[comments.,0,0,232,1]
Token: Term=[Then,,0,0,233,1]
Token: Term=[write,0,0,234,1]
Token: Term=[down,0,0,235,1]
Token: Term=[briefly,0,0,236,1]
Token: Term=[your,0,0,237,1]
Token: Term=[first,0,0,238,1]
Token: Term=[impressions:</FONT></LI>,0,0,239,1]
Token: Term=[<BR>,0,0,240,1]
Token: Term=[<OL,0,0,241,1]
Token: Term=[TYPE="A">,0,0,242,1]
Token: Term=[<LI>,0,0,243,1]
Token: Term=[<FONT,0,0,244,1]
Token: Term=[SIZE=4>Are,0,0,245,1]
Token: Term=[both,0,0,246,1]
Token: Term=[the,0,0,247,1]
Token: Term=[letter,0,0,248,1]
Token: Term=[and,0,0,249,1]
Token: Term=[the,0,0,250,1]
Token: Term=[r&eacute;sum&eacute;,0,0,251,1]
Token: Term=[clean,,0,0,252,1]
Token: Term=[clear,,0,0,253,1]
Token: Term=[professional,,0,0,254,1]
Token: Term=[and,0,0,255,1]
Token: Term=[perfectly,0,0,256,1]
Token: Term=[correct?</FONT></LI>,0,0,257,1]
Token: Term=[<BR>,0,0,258,1]
Token: Term=[<LI>,0,0,259,1]
Token: Term=[<FONT,0,0,260,1]
Token: Term=[SIZE=4>Does,0,0,261,1]
Token: Term=[it,0,0,262,1]
Token: Term=[contain,0,0,263,1]
Token: Term=[anything,0,0,264,1]
Token: Term=[alienating,0,0,265,1]
Token: Term=[or,0,0,266,1]
Token: Term=[off-putting?</FONT></LI>,0,0,267,1]
Token: Term=[<BR>,0,0,268,1]
Token: Term=[<LI>,0,0,269,1]
Token: Term=[<FONT,0,0,270,1]
Token: Term=[SIZE=4>Would,0,0,271,1]
Token: Term=[it,0,0,272,1]
Token: Term=[make,0,0,273,1]
Token: Term=[you,0,0,274,1]
Token: Term=[want,0,0,275,1]
Token: Term=[to,0,0,276,1]
Token: Term=[hire,0,0,277,1]
Token: Term=[the,0,0,278,1]
Token: Term=[author,0,0,279,1]
Token: Term=[if,0,0,280,1]
Token: Term=[you,0,0,281,1]
Token: Term=[were,0,0,282,1]
Token: Term=[hiring,0,0,283,1]
Token: Term=[people,0,0,284,1]
Token: Term=[for,0,0,285,1]
Token: Term=[the,0,0,286,1]
Token: Term=[kind,0,0,287,1]
Token: Term=[of,0,0,288,1]
Token: Term=[job,0,0,289,1]
Token: Term=[the,0,0,290,1]
Token: Term=[author,0,0,291,1]
Token: Term=[wants?</FONT></LI>,0,0,292,1]
Token: Term=[<BR>,0,0,293,1]
Token: Term=[</OL>,0,0,294,1]
Token: Term=[<LI>,0,0,295,1]
Token: Term=[<FONT,0,0,296,1]
Token: Term=[SIZE=4>Read,0,0,297,1]
Token: Term=[it,0,0,298,1]
Token: Term=[again,,0,0,299,1]
Token: Term=[more,0,0,300,1]
Token: Term=[carefully,,0,0,301,1]
Token: Term=[making,0,0,302,1]
Token: Term=[specific,0,0,303,1]
Token: Term=[comments,0,0,304,1]
Token: Term=[in,0,0,305,1]
Token: Term=[the,0,0,306,1]
Token: Term=[margins.,0,0,307,1]
Token: Term=[</FONT></LI>,0,0,308,1]
Token: Term=[<BR>,0,0,309,1]
Token: Term=[<OL,0,0,310,1]
Token: Term=[TYPE="A">,0,0,311,1]
Token: Term=[<LI>,0,0,312,1]
Token: Term=[<FONT,0,0,313,1]
Token: Term=[SIZE=4>Consider,0,0,314,1]
Token: Term=[the,0,0,315,1]
Token: Term=[r&eacute;sum&eacute;:</FONT></LI>,0,0,316,1]
Token: Term=[<BR>,0,0,317,1]
Token: Term=[<OL,0,0,318,1]
Token: Term=[TYPE="1">,0,0,319,1]
Token: Term=[<LI>,0,0,320,1]
Token: Term=[<FONT,0,0,321,1]
Token: Term=[SIZE=4>Does,0,0,322,1]
Token: Term=[it,0,0,323,1]
Token: Term=[include,0,0,324,1]
Token: Term=[all,0,0,325,1]
Token: Term=[the,0,0,326,1]
Token: Term=[author&#39;s,0,0,327,1]
Token: Term=[appropriate,0,0,328,1]
Token: Term=[qualifications?</FONT></LI>,0,0,329,1]
Token: Term=[<BR>,0,0,330,1]
Token: Term=[<LI>,0,0,331,1]
Token: Term=[<FONT,0,0,332,1]
Token: Term=[SIZE=4>Does,0,0,333,1]
Token: Term=[it,0,0,334,1]
Token: Term=[indicate,0,0,335,1]
Token: Term=[any,0,0,336,1]
Token: Term=[gaps,0,0,337,1]
Token: Term=[or,0,0,338,1]
Token: Term=[other,0,0,339,1]
Token: Term=[areas,0,0,340,1]
Token: Term=[that,0,0,341,1]
Token: Term=[need,0,0,342,1]
Token: Term=[more,0,0,343,1]
Token: Term=[explanation?</FONT></LI>,0,0,344,1]
Token: Term=[<BR>,0,0,345,1]
Token: Term=[<LI>,0,0,346,1]
Token: Term=[<FONT,0,0,347,1]
Token: Term=[SIZE=4>Does,0,0,348,1]
Token: Term=[it,0,0,349,1]
Token: Term=[include,0,0,350,1]
Token: Term=[any,0,0,351,1]
Token: Term=[inappropriate,0,0,352,1]
Token: Term=[material?</FONT></LI>,0,0,353,1]
Token: Term=[<BR>,0,0,354,1]
Token: Term=[<LI>,0,0,355,1]
Token: Term=[<FONT,0,0,356,1]
Token: Term=[SIZE=4>Does,0,0,357,1]
Token: Term=[it,0,0,358,1]
Token: Term=[use,0,0,359,1]
Token: Term=[consistent,,0,0,360,1]
Token: Term=[parallel,0,0,361,1]
Token: Term=[language?</FONT></LI>,0,0,362,1]
Token: Term=[<BR>,0,0,363,1]
Token: Term=[<LI>,0,0,364,1]
Token: Term=[<FONT,0,0,365,1]
Token: Term=[SIZE=4>Does,0,0,366,1]
Token: Term=[it,0,0,367,1]
Token: Term=[list,0,0,368,1]
Token: Term=[concrete,0,0,369,1]
Token: Term=[accomplishments,0,0,370,1]
Token: Term=[for,0,0,371,1]
Token: Term=[each,0,0,372,1]
Token: Term=[position,0,0,373,1]
Token: Term=[held,0,0,374,1]
Token: Term=[(where,0,0,375,1]
Token: Term=[appropriate,0,0,376,1]
Token: Term=[and,0,0,377,1]
Token: Term=[available)?</FONT></LI>,0,0,378,1]
Token: Term=[<BR>,0,0,379,1]
Token: Term=[<LI>,0,0,380,1]
Token: Term=[<FONT,0,0,381,1]
Token: Term=[SIZE=4>Does,0,0,382,1]
Token: Term=[it,0,0,383,1]
Token: Term=[give,0,0,384,1]
Token: Term=[indications,,0,0,385,1]
Token: Term=[where,0,0,386,1]
Token: Term=[applicable,,0,0,387,1]
Token: Term=[of,0,0,388,1]
Token: Term=[good,0,0,389,1]
Token: Term=[communication,0,0,390,1]
Token: Term=[skills,,0,0,391,1]
Token: Term=[of,0,0,392,1]
Token: Term=[the,0,0,393,1]
Token: Term=[ability,0,0,394,1]
Token: Term=[to,0,0,395,1]
Token: Term=[work,0,0,396,1]
Token: Term=[with,0,0,397,1]
Token: Term=[others,,0,0,398,1]
Token: Term=[of,0,0,399,1]
Token: Term=[leadership,0,0,400,1]
Token: Term=[skills?</FONT></LI>,0,0,401,1]
Token: Term=[<BR>,0,0,402,1]
Token: Term=[<LI>,0,0,403,1]
Token: Term=[<FONT,0,0,404,1]
Token: Term=[SIZE=4>Can,0,0,405,1]
Token: Term=[you,0,0,406,1]
Token: Term=[list,0,0,407,1]
Token: Term=[three,0,0,408,1]
Token: Term=[ways,0,0,409,1]
Token: Term=[in,0,0,410,1]
Token: Term=[which,0,0,411,1]
Token: Term=[the,0,0,412,1]
Token: Term=[typography,0,0,413,1]
Token: Term=[and,0,0,414,1]
Token: Term=[design,0,0,415,1]
Token: Term=[actually,0,0,416,1]
Token: Term=[<I>help</I>,0,0,417,1]
Token: Term=[the,0,0,418,1]
Token: Term=[r&eacute;sum&eacute;,0,0,419,1]
Token: Term=[do,0,0,420,1]
Token: Term=[its,0,0,421,1]
Token: Term=[job?,0,0,422,1]
Token: Term=[Can,0,0,423,1]
Token: Term=[you,0,0,424,1]
Token: Term=[list,0,0,425,1]
Token: Term=[ways,0,0,426,1]
Token: Term=[in,0,0,427,1]
Token: Term=[which,0,0,428,1]
Token: Term=[they,0,0,429,1]
Token: Term=[interfere,,0,0,430,1]
Token: Term=[and,0,0,431,1]
Token: Term=[suggest,0,0,432,1]
Token: Term=[improvements?</FONT></LI>,0,0,433,1]
Token: Term=[<BR>,0,0,434,1]
Token: Term=[</OL>,0,0,435,1]
Token: Term=[<LI>,0,0,436,1]
Token: Term=[<FONT,0,0,437,1]
Token: Term=[SIZE=4>Consider,0,0,438,1]
Token: Term=[the,0,0,439,1]
Token: Term=[cover,0,0,440,1]
Token: Term=[letter:</FONT></LI>,0,0,441,1]
Token: Term=[<BR>,0,0,442,1]
Token: Term=[<OL,0,0,443,1]
Token: Term=[TYPE="1">,0,0,444,1]
Token: Term=[<LI>,0,0,445,1]
Token: Term=[<FONT,0,0,446,1]
Token: Term=[SIZE=4>Does,0,0,447,1]
Token: Term=[the,0,0,448,1]
Token: Term=[letter,0,0,449,1]
Token: Term=[follow,0,0,450,1]
Token: Term=[an,0,0,451,1]
Token: Term=[appropriate,0,0,452,1]
Token: Term=[form?,0,0,453,1]
Token: Term=[Does,0,0,454,1]
Token: Term=[it,0,0,455,1]
Token: Term=[use,0,0,456,1]
Token: Term=[consistent,,0,0,457,1]
Token: Term=[parallel,0,0,458,1]
Token: Term=[language?,0,0,459,1]
Token: Term=[Is,0,0,460,1]
Token: Term=[the,0,0,461,1]
Token: Term=[tone,0,0,462,1]
Token: Term=[appropriate?</FONT></LI>,0,0,463,1]
Token: Term=[<BR>,0,0,464,1]
Token: Term=[<LI>,0,0,465,1]
Token: Term=[<FONT,0,0,466,1]
Token: Term=[SIZE=4>Does,0,0,467,1]
Token: Term=[the,0,0,468,1]
Token: Term=[author,0,0,469,1]
Token: Term=[highlight,0,0,470,1]
Token: Term=[the,0,0,471,1]
Token: Term=[qualifications,0,0,472,1]
Token: Term=[most,0,0,473,1]
Token: Term=[likely,0,0,474,1]
Token: Term=[to,0,0,475,1]
Token: Term=[get,0,0,476,1]
Token: Term=[him,0,0,477,1]
Token: Term=[or,0,0,478,1]
Token: Term=[her,0,0,479,1]
Token: Term=[an,0,0,480,1]
Token: Term=[interview,0,0,481,1]
Token: Term=[for,0,0,482,1]
Token: Term=[the,0,0,483,1]
Token: Term=[job?</FONT></LI>,0,0,484,1]
Token: Term=[<BR>,0,0,485,1]
Token: Term=[<LI>,0,0,486,1]
Token: Term=[<FONT,0,0,487,1]
Token: Term=[SIZE=4>If,0,0,488,1]
Token: Term=[something,0,0,489,1]
Token: Term=[in,0,0,490,1]
Token: Term=[the,0,0,491,1]
Token: Term=[author&#39;s,0,0,492,1]
Token: Term=[r&eacute;sum&eacute;,0,0,493,1]
Token: Term=[might,0,0,494,1]
Token: Term=[raise,0,0,495,1]
Token: Term=[serious,0,0,496,1]
Token: Term=[questions,0,0,497,1]
Token: Term=[with,0,0,498,1]
Token: Term=[a,0,0,499,1]
Token: Term=[potential,0,0,500,1]
Token: Term=[employer,,0,0,501,1]
Token: Term=[does,0,0,502,1]
Token: Term=[the,0,0,503,1]
Token: Term=[cover,0,0,504,1]
Token: Term=[letter,0,0,505,1]
Token: Term=[deal,0,0,506,1]
Token: Term=[with,0,0,507,1]
Token: Term=[them,0,0,508,1]
Token: Term=[adequately,0,0,509,1]
Token: Term=[without,0,0,510,1]
Token: Term=[drawing,0,0,511,1]
Token: Term=[undue,0,0,512,1]
Token: Term=[attention,0,0,513,1]
Token: Term=[to,0,0,514,1]
Token: Term=[them?</FONT></LI>,0,0,515,1]
Token: Term=[<BR>,0,0,516,1]
Token: Term=[</OL>,0,0,517,1]
Token: Term=[</OL>,0,0,518,1]
Token: Term=[<LI>,0,0,519,1]
Token: Term=[<FONT,0,0,520,1]
Token: Term=[SIZE=4>Review,0,0,521,1]
Token: Term=[your,0,0,522,1]
Token: Term=[comments,0,0,523,1]
Token: Term=[with,0,0,524,1]
Token: Term=[the,0,0,525,1]
Token: Term=[author,0,0,526,1]
Token: Term=[(and,0,0,527,1]
Token: Term=[vice,0,0,528,1]
Token: Term=[versa).,0,0,529,1]
Token: Term=[Be,0,0,530,1]
Token: Term=[sure,0,0,531,1]
Token: Term=[to,0,0,532,1]
Token: Term=[write,0,0,533,1]
Token: Term=[&quot;Edited,0,0,534,1]
Token: Term=[by&quot;,0,0,535,1]
Token: Term=[and,0,0,536,1]
Token: Term=[your,0,0,537,1]
Token: Term=[name,0,0,538,1]
Token: Term=[on,0,0,539,1]
Token: Term=[the,0,0,540,1]
Token: Term=[copy,0,0,541,1]
Token: Term=[you,0,0,542,1]
Token: Term=[edited.,0,0,543,1]
Token: Term=[Give,0,0,544,1]
Token: Term=[your,0,0,545,1]
Token: Term=[comments,0,0,546,1]
Token: Term=[to,0,0,547,1]
Token: Term=[the,0,0,548,1]
Token: Term=[author,,0,0,549,1]
Token: Term=[who,0,0,550,1]
Token: Term=[must,0,0,551,1]
Token: Term=[include,0,0,552,1]
Token: Term=[them,0,0,553,1]
Token: Term=[with,0,0,554,1]
Token: Term=[his,0,0,555,1]
Token: Term=[or,0,0,556,1]
Token: Term=[her,0,0,557,1]
Token: Term=[turned-in,0,0,558,1]
Token: Term=[version.,0,0,559,1]
Token: Term=[Be,0,0,560,1]
Token: Term=[sure,0,0,561,1]
Token: Term=[you,0,0,562,1]
Token: Term=[get,0,0,563,1]
Token: Term=[comments,0,0,564,1]
Token: Term=[from,0,0,565,1]
Token: Term=[your,0,0,566,1]
Token: Term=[editors,,0,0,567,1]
Token: Term=[too.</FONT></LI>,0,0,568,1]
Token: Term=[</OL>,0,0,569,1]
Token: Term=[<BR,0,0,570,1]
Token: Term=[CLEAR=ALL>,0,0,571,1]
Token: Term=[</BODY>,0,0,572,1]
Token: Term=[</HTML>,0,0,573,1]
Token: Term=[,0,0,574,1]
Processing new document:
Token: Term=[<?xml,0,0,0,1]
Token: Term=[version="1.0",0,0,1,1]
Token: Term=[encoding="UTF-8"?>,0,0,2,1]
Token: Term=[<!DOCTYPE,0,0,3,1]
Token: Term=[html,0,0,4,1]
Token: Term=[PUBLIC,0,0,5,1]
Token: Term=["-//W3C//DTD,0,0,6,1]
Token: Term=[XHTML,0,0,7,1]
Token: Term=[1.0,0,0,8,1]
Token: Term=[Strict//EN",0,0,9,1]
Token: Term=["http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">,0,0,10,1]
Token: Term=[<html>,0,0,11,1]
Token: Term=[<head>,0,0,12,1]
Token: Term=[<title>Sanguthevar,0,0,13,1]
Token: Term=[Rajasekaran</title>,0,0,14,1]
Token: Term=[<link,0,0,15,1]
Token: Term=[rel="shortcut,0,0,16,1]
Token: Term=[icon",0,0,17,1]
Token: Term=[href="/fano.png",0,0,18,1]
Token: Term=[type="image/png",0,0,19,1]
Token: Term=[/>,0,0,20,1]
Token: Term=[<link,0,0,21,1]
Token: Term=[rel="stylesheet",0,0,22,1]
Token: Term=[href="/stylesheet.css",0,0,23,1]
Token: Term=[type="text/css",0,0,24,1]
Token: Term=[/>,0,0,25,1]
Token: Term=[</head>,0,0,26,1]
Token: Term=[<body>,0,0,27,1]
Token: Term=[<div,0,0,28,1]
Token: Term=[class="outer">,0,0,29,1]
Token: Term=[<h1>Sanguthevar,0,0,30,1]
Token: Term=[Rajasekaran</h1>,0,0,31,1]
Token: Term=[<div,0,0,32,1]
Token: Term=[class="inner">,0,0,33,1]
Token: Term=[<p><a,0,0,34,1]
Token: Term=[href="../Organization/Univ-of-Florida-Dept-Computer-+-Information-Sci-+-Engr.html">Univ.,0,0,35,1]
Token: Term=[of,0,0,36,1]
Token: Term=[Florida,,0,0,37,1]
Token: Term=[Dept.,0,0,38,1]
Token: Term=[Computer,0,0,39,1]
Token: Term=[&amp;,0,0,40,1]
Token: Term=[Information,0,0,41,1]
Token: Term=[Sci.,0,0,42,1]
Token: Term=[&amp;,0,0,43,1]
Token: Term=[Engr.</a><br,0,0,44,1]
Token: Term=[/>,0,0,45,1]
Token: Term=[<a,0,0,46,1]
Token: Term=[href="http://www.cise.ufl.edu/~raj/">http://www.cise.ufl.edu/~raj/</a><br,0,0,47,1]
Token: Term=[/>,0,0,48,1]
Token: Term=[<a,0,0,49,1]
Token: Term=[href="&#109;&#97;&#105;&#108;&#116;&#111;&#58;&#114;&#97;&#106;&#64;&#99;&#105;&#115;&#101;&#46;&#117;&#102;&#108;&#46;&#101;&#100;&#117;">&#114;&#97;&#106;&#64;&#99;&#105;&#115;&#101;&#46;&#117;&#102;&#108;&#46;&#101;&#100;&#117;</a></p>,0,0,50,1]
Token: Term=[<p>Author,,0,0,51,1]
Token: Term=[editor,,0,0,52,1]
Token: Term=[or,0,0,53,1]
Token: Term=[reviewer,0,0,54,1]
Token: Term=[of:</p>,0,0,55,1]
Token: Term=[<ul>,0,0,56,1]
Token: Term=[<li><a,0,0,57,1]
Token: Term=[href="../Document/Computer-Algorithms.html">Computer,0,0,58,1]
Token: Term=[Algorithms</a></li>,0,0,59,1]
Token: Term=[<li><a,0,0,60,1]
Token: Term=[href="../Document/Computer-AlgorithmsC++.html">Computer,0,0,61,1]
Token: Term=[Algorithms/C++</a></li>,0,0,62,1]
Token: Term=[</ul>,0,0,63,1]
Token: Term=[<div,0,0,64,1]
Token: Term=[class="navbar">,0,0,65,1]
Token: Term=[[<a,0,0,66,1]
Token: Term=[href="http://www.ics.uci.edu/~eppstein/pubs/">D.,0,0,67,1]
Token: Term=[Eppstein,0,0,68,1]
Token: Term=[publications</a>],0,0,69,1]
Token: Term=[[<a,0,0,70,1]
Token: Term=[href="/cites/">Citation,0,0,71,1]
Token: Term=[database</a>],0,0,72,1]
Token: Term=[[<a,0,0,73,1]
Token: Term=[href="/cites/Author/">Authors</a>],0,0,74,1]
Token: Term=[</div>,0,0,75,1]
Token: Term=[</div>,0,0,76,1]
Token: Term=[<div,0,0,77,1]
Token: Term=[class="credit">,0,0,78,1]
Token: Term=[<a,0,0,79,1]
Token: Term=[href="/">Fano</a>,0,0,80,1]
Token: Term=[Experimental,0,0,81,1]
Token: Term=[Web,0,0,82,1]
Token: Term=[Server,,0,0,83,1]
Token: Term=[<a,0,0,84,1]
Token: Term=[href="http://www.ics.uci.edu/~eppstein/">D.,0,0,85,1]
Token: Term=[Eppstein</a>,,0,0,86,1]
Token: Term=[<a,0,0,87,1]
Token: Term=[href="http://www.ics.uci.edu/">School,0,0,88,1]
Token: Term=[of,0,0,89,1]
Token: Term=[Information,0,0,90,1]
Token: Term=[&amp;,0,0,91,1]
Token: Term=[Computer,0,0,92,1]
Token: Term=[Science</a>,,0,0,93,1]
Token: Term=[<a,0,0,94,1]
Token: Term=[href="http://www.uci.edu/">UC,0,0,95,1]
Token: Term=[Irvine</a>,0,0,96,1]
Token: Term=[</div>,0,0,97,1]
Token: Term=[<a,0,0,98,1]
Token: Term=[href="http://store.apple.com/"><img,0,0,99,1]
Token: Term=[alt="Made,0,0,100,1]
Token: Term=[on,0,0,101,1]
Token: Term=[a,0,0,102,1]
Token: Term=[Mac",0,0,103,1]
Token: Term=[height="31",0,0,104,1]
Token: Term=[width="88",0,0,105,1]
Token: Term=[src="/mac.png",0,0,106,1]
Token: Term=[/></a>,0,0,107,1]
Token: Term=[<a,0,0,108,1]
Token: Term=[href="http://validator.w3.org/check/referer"><img,0,0,109,1]
Token: Term=[alt="Valid,0,0,110,1]
Token: Term=[XHTML,0,0,111,1]
Token: Term=[1.0!",0,0,112,1]
Token: Term=[height="31",0,0,113,1]
Token: Term=[width="88",0,0,114,1]
Token: Term=[src="/validx.png",0,0,115,1]
Token: Term=[/></a>,0,0,116,1]
Token: Term=[</div>,0,0,117,1]
Token: Term=[</body>,0,0,118,1]
Token: Term=[</html>,0,0,119,1]
Token: Term=[,0,0,120,1]
PASS   : UnitTest::testSpidy()
PASS   : UnitTest::cleanupTestCase()
Totals: 3 passed, 0 failed, 0 skipped, 0 blacklisted
********* Finished testing of UnitTest *********

Copied from original issue: DaviesX/e8yesearch#1

Metadata

Metadata

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions