• utopiah@lemmy.ml
    link
    fedilink
    arrow-up
    2
    ·
    2 days ago

    Arguable… it’s OKish at best, definitely nowhere near as good as professional… then IMHO it’s like spotting a spelling mistake in an official document, you instantly look for MORE mistakes then it become distracting. There is something powerful about trust that once it’s broken, it’s hard to get back. Once a spelling or here transcription mistake happens, then we brace for more (rationally so) and it becomes a very taxing endeavor.

    So… sure STT progressed quite a bit but it’s STILL not good enough in a lot of cases.

    Case in point, IMHO when there is a choice, most people (everybody?) would rather have human made captions than AI ones.

    • RiQuY@lemm.ee
      link
      fedilink
      arrow-up
      3
      ·
      2 days ago

      So for example if I want to add closed captions to my Twitch streams there is no way that I go pay a person to transcribe/translate the shit I’m saying for 3 hours everyday. Maybe this is a weird example but it fits my needs.

      • utopiah@lemmy.ml
        link
        fedilink
        arrow-up
        2
        ·
        2 days ago

        Not a weird example. I have my self hosted video server (PeerTube) and I tinkered with transcription thanks to whisper.cpp locally. It “works” in the sense that most of it is acceptable. It still does mistake though. I provide all my content, including hosting, at my costs and to anyone in the World for free.

        So… I definitely see the value. I’m only saying that it has downsides and quality-wise relative to professional, it’s still bad.

      • utopiah@lemmy.ml
        link
        fedilink
        arrow-up
        3
        ·
        2 days ago

        Indeed, professionals are expensive, and IMHO rightfully so. I’m only trying to highlight the fact that it’s unfair to imply it’s much cheaper when the quality isn’t on par.