Forbes articles don't work. #2

fergyfresh · 2015-09-27T13:44:03Z

I was using BeautifulSoup to scrape through a Forbes article to pull out the text until I realized Alchemy has an API to do this. I was running into an annoying scenario where the landing page for any forbes article is the forbes/home page with a 'Continue 3..2..1' displayed on the page. I was able to work around that, but it seems that your API doesn't. Can I feed the raw html from BeautifulSoup to an API call on your end? This would be my quickest workaround. I will be reading the API docs but currently this is what happens.

URL: http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/
What I am expecting is the text on the response object to have the article text, but it has the continue text.

from alchemyapi import AlchemyAPI
import json

# Create the AlchemyAPI Object
alchemyapi = AlchemyAPI()

# Create demo url which will be user input later
demo_url = 'http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/'

# Created response object from input url
response = alchemyapi.text('url', demo_url)

if response['status'] == 'OK':
    print('## Response Object ##')
    print(json.dumps(response, indent=4))

    print('')
    print('## Text ##')
    print('text: ', response['text'].encode('utf-8'))
    print('')
else:
    print('Error in text extraction call: ', response['statusInfo'])

steveherschleb · 2015-09-28T18:00:14Z

Hi Billy, I'm no longer the maintainer of this SDK and I should probably
remove this repo. You're questions are probably better answered by the team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson notifications@github.com
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull out
the text until I realized Alchemy has an API to do this. I was running into
an annoying scenario where the landing page for any forbes article is the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I was
able to work around that, but it seems that your API doesn't. Can I feed
the raw html from BeautifulSoup to an API call on your end? This would be
my quickest workaround. I will be reading the API docs but currently this
is what happens.

URL:
http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)

—
Reply to this email directly or view it on GitHub
#2.

fergyfresh · 2015-09-28T18:01:48Z

I couldn't log issues there. Thanks though.

On Mon, Sep 28, 2015, 2:00 PM Steve Herschleb notifications@github.com
wrote:

Hi Billy, I'm no longer the maintainer of this SDK and I should probably
remove this repo. You're questions are probably better answered by the team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson notifications@github.com
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull out
the text until I realized Alchemy has an API to do this. I was running
into
an annoying scenario where the landing page for any forbes article is the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I was
able to work around that, but it seems that your API doesn't. Can I feed
the raw html from BeautifulSoup to an API call on your end? This would be
my quickest workaround. I will be reading the API docs but currently this
is what happens.

URL:

http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the
article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)

—
Reply to this email directly or view it on GitHub
#2.

—
Reply to this email directly or view it on GitHub
#2 (comment)
.

steveherschleb · 2015-09-28T18:04:15Z

Maybe this would help: http://www.alchemyapi.com/products/contact-support

Sorry I haven't worked there in a few years, so I don't really have any
extra info for you.

On Mon, Sep 28, 2015 at 12:01 PM, Billy Ferguson notifications@github.com
wrote:

I couldn't log issues there. Thanks though.

On Mon, Sep 28, 2015, 2:00 PM Steve Herschleb notifications@github.com
wrote:

Hi Billy, I'm no longer the maintainer of this SDK and I should probably
remove this repo. You're questions are probably better answered by the
team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson <
notifications@github.com>
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull
out
the text until I realized Alchemy has an API to do this. I was running
into
an annoying scenario where the landing page for any forbes article is
the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I was
able to work around that, but it seems that your API doesn't. Can I
feed
the raw html from BeautifulSoup to an API call on your end? This would
be
my quickest workaround. I will be reading the API docs but currently
this
is what happens.

URL:

http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the
article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)

—
Reply to this email directly or view it on GitHub
#2.

—
Reply to this email directly or view it on GitHub
<
#2 (comment)

.

—
Reply to this email directly or view it on GitHub
#2 (comment)
.

fergyfresh · 2015-09-28T18:08:29Z

Thanks man, I emailed the support and just chased the repo to your repo.
On Sep 28, 2015 2:04 PM, "Steve Herschleb" notifications@github.com wrote:

Maybe this would help: http://www.alchemyapi.com/products/contact-support

Sorry I haven't worked there in a few years, so I don't really have any
extra info for you.

On Mon, Sep 28, 2015 at 12:01 PM, Billy Ferguson <notifications@github.com

wrote:

I couldn't log issues there. Thanks though.

On Mon, Sep 28, 2015, 2:00 PM Steve Herschleb notifications@github.com
wrote:

Hi Billy, I'm no longer the maintainer of this SDK and I should
probably
remove this repo. You're questions are probably better answered by the
team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson <
notifications@github.com>
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull
out
the text until I realized Alchemy has an API to do this. I was
running
into
an annoying scenario where the landing page for any forbes article is
the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I
was
able to work around that, but it seems that your API doesn't. Can I
feed
the raw html from BeautifulSoup to an API call on your end? This
would
be
my quickest workaround. I will be reading the API docs but currently
this
is what happens.

URL:

http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the
article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)

—
Reply to this email directly or view it on GitHub
#2.

—
Reply to this email directly or view it on GitHub
<

#2 (comment)

.

—
Reply to this email directly or view it on GitHub
<
#2 (comment)

.

—
Reply to this email directly or view it on GitHub
#2 (comment)
.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Forbes articles don't work. #2

Forbes articles don't work. #2

fergyfresh commented Sep 27, 2015

steveherschleb commented Sep 28, 2015

Created response object from input url

Uh oh!

fergyfresh commented Sep 28, 2015

Created response object from input url

Uh oh!

steveherschleb commented Sep 28, 2015

Created response object from input url

Uh oh!

fergyfresh commented Sep 28, 2015

Created response object from input url

Uh oh!

Forbes articles don't work. #2

Forbes articles don't work. #2

Comments

fergyfresh commented Sep 27, 2015

steveherschleb commented Sep 28, 2015

Created response object from input url

Uh oh!

fergyfresh commented Sep 28, 2015

Created response object from input url

Uh oh!

steveherschleb commented Sep 28, 2015

Created response object from input url

Uh oh!

fergyfresh commented Sep 28, 2015

Created response object from input url

Uh oh!