Skip to content

extract PubChem section's content #385

@francoiskroll

Description

@francoiskroll

Thanks for an amazing package! Incredibly helpful.

I tagged this as "bug" but it might be a "Database suggestion", depends on the answer...

As example, let's take the PubChem page for aspirin (CID=2244), section "Associated Disorders and Diseases": https://pubchem.ncbi.nlm.nih.gov/compound/2244#section=Associated-Disorders-and-Diseases

Is there any way I can extract 'useful' data from this sort of sections? Namely the list of diseases here.

I tried

pc_sect(id='2244',
        section='Associated Disorders and Diseases',
        domain='compound')

It does run, but it returns:

# A tibble: 3 × 5
  CID   Name    Result               SourceName                                SourceID         
  <chr> <chr>   <chr>                <chr>                                     <chr>            
1 2244  Aspirin ctd_chemical_disease Comparative Toxicogenomics Database (CTD) D001241::Compound
2 2244  Aspirin collection=ttd_dd    Therapeutic Target Database (TTD)         D07DPI           
3 2244  Aspirin collection=ttd_dd    Therapeutic Target Database (TTD)         D0GY5Z  

which is not really I am interested in...

Am I missing something?

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or enhancement

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions