Skip to content

Forbes articles don't work. #2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
fergyfresh opened this issue Sep 27, 2015 · 4 comments
Open

Forbes articles don't work. #2

fergyfresh opened this issue Sep 27, 2015 · 4 comments

Comments

@fergyfresh
Copy link

I was using BeautifulSoup to scrape through a Forbes article to pull out the text until I realized Alchemy has an API to do this. I was running into an annoying scenario where the landing page for any forbes article is the forbes/home page with a 'Continue 3..2..1' displayed on the page. I was able to work around that, but it seems that your API doesn't. Can I feed the raw html from BeautifulSoup to an API call on your end? This would be my quickest workaround. I will be reading the API docs but currently this is what happens.

from alchemyapi import AlchemyAPI
import json

# Create the AlchemyAPI Object
alchemyapi = AlchemyAPI()

# Create demo url which will be user input later
demo_url = 'http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/'

# Created response object from input url
response = alchemyapi.text('url', demo_url)

if response['status'] == 'OK':
    print('## Response Object ##')
    print(json.dumps(response, indent=4))

    print('')
    print('## Text ##')
    print('text: ', response['text'].encode('utf-8'))
    print('')
else:
    print('Error in text extraction call: ', response['statusInfo'])
@steveherschleb
Copy link
Owner

Hi Billy, I'm no longer the maintainer of this SDK and I should probably
remove this repo. You're questions are probably better answered by the team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson notifications@github.com
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull out
the text until I realized Alchemy has an API to do this. I was running into
an annoying scenario where the landing page for any forbes article is the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I was
able to work around that, but it seems that your API doesn't. Can I feed
the raw html from BeautifulSoup to an API call on your end? This would be
my quickest workaround. I will be reading the API docs but currently this
is what happens.

URL:
http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)


Reply to this email directly or view it on GitHub
#2.

@fergyfresh
Copy link
Author

I couldn't log issues there. Thanks though.

On Mon, Sep 28, 2015, 2:00 PM Steve Herschleb notifications@github.com
wrote:

Hi Billy, I'm no longer the maintainer of this SDK and I should probably
remove this repo. You're questions are probably better answered by the team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson notifications@github.com
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull out
the text until I realized Alchemy has an API to do this. I was running
into
an annoying scenario where the landing page for any forbes article is the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I was
able to work around that, but it seems that your API doesn't. Can I feed
the raw html from BeautifulSoup to an API call on your end? This would be
my quickest workaround. I will be reading the API docs but currently this
is what happens.

URL:

http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the
article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)


Reply to this email directly or view it on GitHub
#2.


Reply to this email directly or view it on GitHub
#2 (comment)
.

@steveherschleb
Copy link
Owner

Maybe this would help: http://www.alchemyapi.com/products/contact-support

Sorry I haven't worked there in a few years, so I don't really have any
extra info for you.

On Mon, Sep 28, 2015 at 12:01 PM, Billy Ferguson notifications@github.com
wrote:

I couldn't log issues there. Thanks though.

On Mon, Sep 28, 2015, 2:00 PM Steve Herschleb notifications@github.com
wrote:

Hi Billy, I'm no longer the maintainer of this SDK and I should probably
remove this repo. You're questions are probably better answered by the
team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson <
notifications@github.com>
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull
out
the text until I realized Alchemy has an API to do this. I was running
into
an annoying scenario where the landing page for any forbes article is
the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I was
able to work around that, but it seems that your API doesn't. Can I
feed
the raw html from BeautifulSoup to an API call on your end? This would
be
my quickest workaround. I will be reading the API docs but currently
this
is what happens.

URL:

http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the
article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)


Reply to this email directly or view it on GitHub
#2.


Reply to this email directly or view it on GitHub
<
#2 (comment)

.


Reply to this email directly or view it on GitHub
#2 (comment)
.

@fergyfresh
Copy link
Author

Thanks man, I emailed the support and just chased the repo to your repo.
On Sep 28, 2015 2:04 PM, "Steve Herschleb" notifications@github.com wrote:

Maybe this would help: http://www.alchemyapi.com/products/contact-support

Sorry I haven't worked there in a few years, so I don't really have any
extra info for you.

On Mon, Sep 28, 2015 at 12:01 PM, Billy Ferguson <notifications@github.com

wrote:

I couldn't log issues there. Thanks though.

On Mon, Sep 28, 2015, 2:00 PM Steve Herschleb notifications@github.com
wrote:

Hi Billy, I'm no longer the maintainer of this SDK and I should
probably
remove this repo. You're questions are probably better answered by the
team
at AlchemyAPI, here's their profile: https://github.com/alchemyapi

On Sun, Sep 27, 2015 at 7:44 AM, Billy Ferguson <
notifications@github.com>
wrote:

I was using BeautifulSoup to scrape through a Forbes article to pull
out
the text until I realized Alchemy has an API to do this. I was
running
into
an annoying scenario where the landing page for any forbes article is
the
forbes/home page with a 'Continue 3..2..1' displayed on the page. I
was
able to work around that, but it seems that your API doesn't. Can I
feed
the raw html from BeautifulSoup to an API call on your end? This
would
be
my quickest workaround. I will be reading the API docs but currently
this
is what happens.

URL:

http://www.forbes.com/sites/laurashin/2015/09/09/bitcoins-shared-ledger-technology-moneys-new-operating-system/

What I am expecting is the text on the response object to have the
article
text, but it has the continue text.

Created response object from input url

response = alchemyapi.text('url', demo_url)


Reply to this email directly or view it on GitHub
#2.


Reply to this email directly or view it on GitHub
<

#2 (comment)

.


Reply to this email directly or view it on GitHub
<
#2 (comment)

.


Reply to this email directly or view it on GitHub
#2 (comment)
.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants