tm.plugin.webmining

tm.plugin.webmining is an R-package which facilitates text retrieval from feed formats like XML (RSS, ATOM) and JSON. Also direct retrieval from HTML is supported. As most (news) feeds only incorporate small fractions of the original text tm.plugin.webmining even extracts the text from the original text source.

Install

To install the latest version from CRAN simply

install.packages("tm.plugin.webmining")

Using the devtools package you can easily install the latest development version of tm.plugin.webmining from github with

library(devtools)
install_github("mannau/tm.plugin.webmining")

Windows users need to use the following command to install from github:

library(devtools)
install_github("mannau/boilerpipeR", args = "--no-multiarch")

Usage

The next snippet shows how to download and extract the main text from all supported sources as WebCorpus objects including a rich set of metadata like Author, DateTimeStamp or Source:

library(tm.plugin.webmining)
googlefinance <- WebCorpus(GoogleFinanceSource("NASDAQ:MSFT"))
googlenews <- WebCorpus(GoogleNewsSource("Microsoft"))
nytimes <- WebCorpus(NYTimesSource("Microsoft", appid = "<nytimes_appid>"))
reutersnews <- WebCorpus(ReutersNewsSource("businessNews"))
#twitter <- WebCorpus(TwitterSource("Microsoft")) -> not supported yet
yahoofinance <- WebCorpus(YahooFinanceSource("MSFT"))
yahooinplay <- WebCorpus(YahooInplaySource())
yahoonews <- WebCorpus(YahooNewsSource("Microsoft"))
liberation <- WebCorpus(LiberationSource("latest"))

License

tm.plugin.webmining is released under the GNU General Public License Version 3

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
R		R
data		data
inst		inst
man		man
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.travis.yml		.travis.yml
DESCRIPTION		DESCRIPTION
Makefile		Makefile
NAMESPACE		NAMESPACE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tm.plugin.webmining

Install

Usage

License

About

Uh oh!

Releases

Packages

Languages

mannau/tm.plugin.webmining

Folders and files

Latest commit

History

Repository files navigation

tm.plugin.webmining

Install

Usage

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages