Skip to content

investigate expected cost of cloud deployment, and as well as possible approaches for measuring cost #1425

@jmartin-sul

Description

@jmartin-sul

from sul-dlss/speech-to-text#3

see if back of the envelope estimates could rule out some of our possible implementations, or if they all look close enough in cost to keep pursuing all paths.

this involves two orthogonal sets of choices: how to deploy, and how to measure cost of deployed services.

deployment choices (not all mutually exclusive, but some are):

  • ECS, possibly with some rules around scaling, unclear whether we can scale from 0 to 1 or more GPU EC2 instances in the ECS cluster. may need to have at least one EC2 instance always up, but it might be ok for it to be non-GPU? is it possible to have heterogeneous mix of VM types in one ECS cluster?
  • always on EC2 VM
  • thin vs heavy whisper docker container (i.e. is the container built such that it contains the models it needs, or does it download them on startup? whisper models total out to about 13 GB, says @edsu)
  • SageMaker vs docker container running whisper (concerns about SageMaker configurability and training data policy captured in [investigate/prototype] speech_to_text_generation_service approach 2: Explore AWS SageMaker speech-to-text#4)

cost estimate approach:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions