Skip to content

Word-precision timecodes for audio w/o lyrics in online databases, optionally providing manual transcript #6

@porg

Description

@porg

Are the following use cases supported?

Goal / Desired output: Lyrics or subtitle file with word-precision timecodes

  • Application for lyrics file: Karaoke, foreign language learning, etc.
  • Application for video subtitles: Foreign language learning, search and jump to specific word in video, etc.

Starting point(s)

  1. Unpublished audio file for which there yet exists no public lyrics/transcripts.
    • a) Voice only as an isolated audio track.
    • b) Voice on top of instruments, no separation.
  2. Optional: Provide a manually created transcript file (simple plain text, line by line, no timecodes) to aid the processing with reliable cues.
    • Applications:
      • Less standardized language such as a dialect.
      • Or voices which for artistic purposes have an extraordinary pronunciation or tone naturally by the singer (think singers in Heavy Metal, Jazz, stylizations like Vibrato or Jodeling or voices like Björk) or due to heavy effects such as vocoder, echo, distortion, etc.
    • Added value for lyrics file creator: No need to create the timecodes by hand.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions