{"id":8495,"date":"2026-01-08T09:23:17","date_gmt":"2026-01-08T09:23:17","guid":{"rendered":"https:\/\/www.hostinger.com\/blog\/?p=8495"},"modified":"2026-01-08T09:23:19","modified_gmt":"2026-01-08T09:23:19","slug":"balancing-horizons-llms","status":"publish","type":"post","link":"https:\/\/www.hostinger.com\/blog\/balancing-horizons-llms\/","title":{"rendered":"LLMs under the hood of Hostinger Horizons: Balancing performance, speed, and cost"},"content":{"rendered":"<p>The <strong>large language model (LLM)<\/strong> race is accelerating, with new architectures, fine-tunes, and specialized systems arriving before the last ones have even settled. With such intense dynamics, selecting the right model takes intention, speed, and constant re-evaluation.<\/p><p>Rather than committing to a single provider or architecture, we systematically benchmark models across a wide range of real-world tasks and domain-specific scenarios. By continuously integrating and testing the latest LLMs, we ensure that <a href=\"https:\/\/www.hostinger.com\/horizons\" target=\"_blank\" rel=\"noreferrer noopener\">Hostinger Horizons<\/a>, your <strong>all-in-one, no-code AI partner<\/strong>, is always powered by top tech to deliver the strongest <strong>performance<\/strong>, <strong>reliability<\/strong>, and <strong>value<\/strong>. Here&rsquo;s what our latest assessments and experiences reveal.<\/p><h2 class=\"wp-block-heading\" id=\"h-who-leads-the-race\">Who leads the race?<\/h2><p>Out of dozens of major LLMs currently competing on the market &ndash; each with its own strengths and weaknesses &ndash; we always use a combination of at least several and stay up to date with the latest developments and releases. One such example was the launch of <strong>Gemini 3<\/strong> by <strong>Google<\/strong> in mid-November last year. It generated <a href=\"https:\/\/www.zdnet.com\/article\/want-to-ditch-chatgpt-gemini-3-shows-early-signs-of-winning-the-ai-race\/\" target=\"_blank\" rel=\"noreferrer noopener\">quite a buzz<\/a>, and our internal research confirmed that <strong>Gemini 3<\/strong> is indeed worth the hype.&nbsp;<\/p><p>Today, <strong>Gemini 3<\/strong> powers parts of <strong>Hostinger Horizons<\/strong>, delivering more precise, higher-quality code than <strong>Gemini 2.5<\/strong>. It also fixes errors more reliably, with our autofix success jumping from <strong>50%<\/strong> to <strong>80%<\/strong>. Though some <a href=\"https:\/\/www.vals.ai\/benchmarks\/lcb\" target=\"_blank\" rel=\"noreferrer noopener\">coding-oriented benchmarks<\/a> still put <strong>Gemini 3<\/strong> behind <strong>GPT-5 mini<\/strong>, <strong>GPT-5.1,<\/strong> and now also <a href=\"https:\/\/www.rdworldonline.com\/how-gpt-5-2-stacks-up-against-gemini-3-0-and-claude-opus-4-5\/\" target=\"_blank\" rel=\"noreferrer noopener\">GPT-5.2<\/a>, in our experience, Google&rsquo;s newest model truly delivers.<\/p><p>\n\n\n\n<div class=\"editor\">\n                    <h4 class=\"title\">Expert comment<\/h4>\n                    <p> Gemini 3 is quite capable, especially with more nuanced tasks. For example, while testing it, we were able to generate an intricate finance website with just one prompt. While accurate and powerful, Gemini 3 is rather slow. That is why we don&rsquo;t use it for simpler changes where a faster model can deliver a similar solution.&rdquo; <\/p>\n                    <div class=\"d-flex mt-40\">\n                        <div class=\"author-photo\">\n                            <img decoding=\"async\" src=\"https:\/\/imagedelivery.net\/LqiWLm-3MGbYHtFuUbcBtA\/wp-content\/uploads\/sites\/4\/2025\/11\/Screenshot-2025-11-18-at-17.58.26.png\/w=65,h=65,fit=scale-down\" width=\"65\" height=\"65\" class=\"border-radius-50p\" alt=\"Editor\" \/>\n                        <\/div>\n                        <div class=\"mt-auto mb-auto\">\n                            <p class=\"author-name\">Dainius Kavoliunas<\/p>\n                            <p class=\"author-position\">Head of Hostinger Horizons<\/p>\n                        <\/div>\n                    <\/div>\n                <\/div>\n\n\n\n<\/p><p><strong>Gemini 3<\/strong> is one of the LLMs powering <strong>Hostinger Horizons<\/strong>. It handles coding tasks and is paired with our <a href=\"https:\/\/www.hostinger.com\/blog\/product-updates-2025\/#:~:text=describe%2C%20and%20done.-,Communication%20agent,-.%20If%20your%20prompt\" target=\"_blank\" rel=\"noreferrer noopener\">communication agent<\/a> &ndash; a new feature that allows AI to ask clarifying questions whenever the prompt is unclear or vague. The communication agent helps Horizons&nbsp; understand what the user wants, which leads to more accurate code generation, an improved final result, and a smoother overall experience. Importantly, these clarifying messages are free &ndash; AI credits are only required for code changes.<\/p><h2 class=\"wp-block-heading\" id=\"h-the-newcomer-opus-4-5\">The newcomer: Opus 4.5<\/h2><p>Just days after Google released <strong>Gemini 3<\/strong>, <strong>Anthropic<\/strong> launched <strong>Claude Opus 4.5<\/strong>. In our internal quality score for landpage generation, this newcomer ranks among the top-performing models &ndash; right up there with the latest GPT models, as well as <strong>Gemini 3<\/strong>.<\/p><p>However, <strong>Opus 4.5<\/strong> uses more tokens to achieve the same result as the older <strong>Claude Sonnet 4.5<\/strong>.<\/p><p>&ldquo;For initial prompts, we&rsquo;re still mainly using <strong>Sonnet 4.5<\/strong>, which has proven reliable for most generation tasks. But we&rsquo;re investigating <strong>Opus 4.5<\/strong> as an alternative. It follows directions very well, doesn&rsquo;t make errors, and produces beautiful websites. Technically, it is a very powerful model,&rdquo; said Dainius Kavoli&#363;nas, Head of Hostinger Horizons.<\/p><p>The real capabilities of <strong>Opus 4.5<\/strong> shine when one pushes the model to its limits &ndash; such as by asking it to generate a comprehensive planning app with advanced color palettes, numerous buttons, gradients, and animations in one shot. This is supported by many <a href=\"https:\/\/www.datastudios.org\/post\/claude-opus-4-5-vs-claude-sonnet-4-5-full-report-and-comparison-of-features-performance-pricing-a\" target=\"_blank\" rel=\"noreferrer noopener\">benchmark scores<\/a>, indicating that <strong>Opus 4.5<\/strong> outperforms <strong>Sonnet 4.5<\/strong> in areas such as novel problem-solving and advanced reasoning. On <a href=\"https:\/\/www.rdworldonline.com\/how-gpt-5-2-stacks-up-against-gemini-3-0-and-claude-opus-4-5\/\" target=\"_blank\" rel=\"noreferrer noopener\">SWE-bench Verified<\/a>, a benchmark used to assess model performance for coding tasks, <strong>Opus 4.5<\/strong> slightly edges out the recent <strong>GPT-5.2 Thinking<\/strong> (<strong>80.9%<\/strong> vs. <strong>80%<\/strong>) and quite significantly beats <strong>Gemini 3<\/strong> (<strong>76.2%<\/strong>).<\/p><h2 class=\"wp-block-heading\" id=\"h-finding-the-balance\">Finding the balance<\/h2><p>By mixing and combining various AI models, we&rsquo;ve reduced the total response time of <strong>Hostinger Horizons<\/strong> by 25%. Also, the background error check after coding now takes only 12 seconds, compared to 40 seconds a month ago.<\/p><p>&ldquo;In the end, it all comes down to using the right model for the right task and in the right context. So far, we have found that <strong>Sonnet 4.5<\/strong> takes the lead in the initial prompting stage, and <strong>Gemini 3<\/strong> is optimal for subsequent fixes and adjustments, with other models invoked depending on the situation. There&rsquo;s obviously no single formula, and top scores on benchmarks don&rsquo;t guarantee the best results when LLMs are used in real-life products. Therefore, we constantly work on testing, improving, and finding the right balance to bring the best experience to our clients,&rdquo; said Kavoli&#363;nas.<\/p><p>Whether current leaders will maintain their positions or be displaced by competitors remains to be seen. But one thing is certain: we&rsquo;re intent on staying ahead by continuously testing, comparing, and optimizing. Our goal remains the same: making website creation and management as simple as possible.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The large language model (LLM) race is accelerating, with new architectures, fine-tunes, and specialized systems arriving before the last ones have even settled. With such intense\u2026<\/p>\n","protected":false},"author":405,"featured_media":8497,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[82],"tags":[],"hashtags":[],"class_list":["post-8495","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-engineering"],"hreflangs":[],"_links":{"self":[{"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/posts\/8495","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/users\/405"}],"replies":[{"embeddable":true,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/comments?post=8495"}],"version-history":[{"count":2,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/posts\/8495\/revisions"}],"predecessor-version":[{"id":8499,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/posts\/8495\/revisions\/8499"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/media\/8497"}],"wp:attachment":[{"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/media?parent=8495"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/categories?post=8495"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/tags?post=8495"},{"taxonomy":"hashtags","embeddable":true,"href":"https:\/\/www.hostinger.com\/blog\/wp-json\/wp\/v2\/hashtags?post=8495"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}