Skip to content

majority of bold records are being filtered #53

@johrstrom

Description

@johrstrom

While working on #52 - I got this report. Apparently something like 82% of bold records are being marked as invalid.

- input_records: 585125
  invalid_records: 479287
  invalid_taxons: 494
  invalid_occurences: 0
  output_files: 105344
  name: add_bold_records
  time: '2022-02-23T16:59:10-05:00'

They're getting filtered here.

# TODO: every skipped record should be written to a file
unless record.gene_symbol_mapped.present? && record.sequence.present? && record.species.present? && record.species_binomial?
invalid_records += 1
next
end

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions