• brucethemoose@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 hours ago

      Yeah, Mistral is just… not very good anymore.

      Even strictly compared to open weights models. Everything I’ve tried feels obsolete.

    • placebo@lemmy.zip
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      2
      ·
      14 hours ago

      I don’t know what models power Lumo, but Mistral is so far behind the competition it’s not even funny.

      • Jiral@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        2
        ·
        14 hours ago

        Mistral Medium 3.5 isn’t that far behind comparable current open weight models.

        • placebo@lemmy.zip
          link
          fedilink
          English
          arrow-up
          1
          ·
          14 hours ago

          Maybe, but Mistral is a commercial company that offers commercial products that can’t really compete with OpenAI, Anthropic, and the others. That’s what I meant.

          • TorstenTyp@feddit.nu
            link
            fedilink
            English
            arrow-up
            2
            ·
            14 hours ago

            I’m curious as to why you say that? I use Le Chat and to me it feels exactly like ChatGPT or Claude, it can code well, translate, search online, everything.

            • gerryflap@feddit.nl
              link
              fedilink
              English
              arrow-up
              1
              ·
              10 hours ago

              Personally I really struggle with it. I wanna use it because it’s not from a US company, but it’s just wrong in like 60% of whatever I ask it. Sometimes even when corrected. It’s good though, makes me stop using the clanker and actually search for myself. Sometimes I do resort to ChatGPT or accidentally invoke Gemini when searching though, and those two have a way higher hit rate

              • TorstenTyp@feddit.nu
                link
                fedilink
                English
                arrow-up
                3
                ·
                8 hours ago

                My two cents on the matter is that LLMs are incredibly useful, but I would never use it to that end. I never ask about facts or use it as a search engine. Sure, use it as a springboard if you don’t know where to start or what to search for, or maybe to find the right link to a website. But directly relying on it for information is risky, whatever the model. Maybe this is the reason our experiences seem to differ.

            • placebo@lemmy.zip
              link
              fedilink
              English
              arrow-up
              2
              ·
              14 hours ago

              Subjectively, it feels similar to models we used a year or two ago. Not that drastically different from what Anthropic and OpenAI offer today, but slightly worse. For instance, for complex coding tasks it offers basic solutions, while Claude often offers more options and details - as if it knows more.

              Objectively, benchmarks. Mistral looks comparable to other open weight models (as another user mentioned), but not as good otherwise.

              • onnekas@sopuli.xyz
                link
                fedilink
                English
                arrow-up
                2
                ·
                edit-2
                11 hours ago

                With the results of chat out of the box I kind of agree with you that mistral feels behind.

                However, there are some features that I really like and make the experience even better than chatgpt.

                1. Agents are pretty cool and with some setup produce very good results
                2. managing libraries for documents/context is better than in ChatGPT. Also adding specific libraries to agents is nice.
                3. Scheduling tasks has just been added and I want to try that out.

                (I have never tried the paid version of any LLM chat so I can only compare free tiers)

              • TorstenTyp@feddit.nu
                link
                fedilink
                English
                arrow-up
                2
                ·
                13 hours ago

                I see, that’s about the time they all got so good that I stopped trying to keep up with the latest benchmarks. It works perfectly for my needs so I definitely wouldn’t dismiss it for anyone wanting to switch to a European alternative.

                • placebo@lemmy.zip
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  12 hours ago

                  Sure, totally depends on your needs. But it’d be great if we had one of them frontier models in Europe.

                  • Jiral@lemmy.world
                    link
                    fedilink
                    English
                    arrow-up
                    1
                    ·
                    edit-2
                    11 hours ago

                    Mistral has recently shown a good trajectory of improvement. It is already an important thing that there is a European mid range open weight model that can compete. (Frontier models need a lot more resources, it is important to compare apples with apples) This is good enough for many applications were data security and sovereignity are prime concerns. Of course, it would be good to have a frontier model, lets see how Large 4 will perform when we get there.