Premium Content Waitlist Banner

Digital Product Studio

Reka Flash 3, The 21B AI Reasoning Model That Excels at Chat, Coding and More

Reka Flash 3, The 21B AI Reasoning Model That Excels at Chat, Coding and More

Reka AI just released a new AI Model, Reka Flash 3. This AI packs powerful thinking abilities into a smaller package of 21 billion parameters. It’s good at chatting, writing code, following instructions, and connecting with other programs. What makes Reka Flash 3 special is that it works just as well as expensive private models like OpenAI’s o1-mini, even though it’s available for everyone to use. It’s currently the best model of its size anywhere. If you need a fast AI that doesn’t use too many resources, it is perfect for the job.

How Reka Flash 3 Was Trained

It is built from scratch using a three-step process. 

1. Pretraining: First, researchers fed it loads of information from public sources and datasets. 

2. Instruction Tuning: Next, they taught it to follow instructions using carefully selected, high-quality examples. 

3. Reinforcement Learning: Finally, they used a smart learning method called REINFORCE Leave One-Out (RLOO) that gave the model rewards when it did well. 

Unlike some models that only focus on math or coding, Reka Flash 3 was trained to be good at many different things. 

Reasoning Process of Reka Flash 3

It shows you its thinking process using <reasoning> and </reasoning> tags. This lets you see exactly how it solves problems. For complicated questions, it might think for a long time, but you can tell it to wrap up its thinking after a certain number of steps. This is called “budget forcing.”

Even with limited thinking time, Reka Flash 3 still gives good answers. Tests on math problems (AIME-2024) show that while it does better with more thinking time, it can still perform well with budget constraints.

The budget-forcing feature is a clever way to balance deep thinking with efficiency. By using tags to mark the thinking process, developers can control how much time the model spends reasoning.

Key Features of Reka Flash 3

1. Advanced Reasoning Capabilities

As discussed above, the AI model uses reasoning tags (<reasoning> and </reasoning>) to think through problems step-by-step before delivering answers. 

2. Compact Architecture

With 21B parameters, Reka Flash 3 is smaller than many competitors but rivals larger models in performance. 

3. Long Context Window

A 32k context length allows it to handle lengthy documents, multi-turn conversations, and detailed instructions.

4. On-Device Deployment Potential

Its efficient size and strong performance make it ideal for on-device deployment, ensuring privacy, local processing, and usability in low-connectivity environments.

5. Open-Source Availability

Available under the Apache 2.0 license, Reka Flash 3 lets developers freely download, modify, and use the model weights. 

6. Llama-Compatible Format

Released in a Llama-compatible format, it integrates seamlessly with tools like Hugging Face Transformers and vLLM. 

7. Multilingual Understanding

While primarily trained in English, it demonstrates impressive capabilities in understanding and communicating in other languages.

Comparison With Other AI Models

When tested against other AI models, Reka Flash 3 holds its own really well. It was compared directly with OpenAI’s o1-mini and Alibaba’s QwQ-32B. While QwQ-32B does better on some math tests (AIME’24), Reka Flash 3 matches it on newer tests (AIME’25). Despite the smaller size, it still keeps up with the bigger competitors.

Moreover, Reka Flash 3 is much better than its previous version, Reka Flash 2.5.

How to Use Reka Flash 3

You can try the AI model in a few ways. Right now, the easiest is to visit Reka Space and start chatting with the model. If you’re a developer or researcher, you can download the model weights under the Apache 2.0 license, which lets you modify and use them freely.

Developers will be happy to know that it works with existing Llama-compatible libraries, making it easy to implement. There are two main ways to start using it.

You can use Hugging Face by loading the model and tokenizer from “RekaAI/reka-flash-3” with the right settings. Or you can use vLLM with Docker to serve the model. These options make it accessible to developers so they can quickly add it to their projects.

1. Prompt Formatting

Reka Flash 3 uses the cl100k_base tokenizer without extra special tokens. It follows a specific format for prompts: “human: [prompt] <sep> assistant: [response] <sep>”. It stops generating text when it sees “<sep>” or “<|endoftext|>”.

You can add system prompts by putting them before the first user message. For conversations with multiple turns, it’s best to remove the reasoning traces from previous responses to save tokens. If you’re using Hugging Face or vLLM, the chat_template handles formatting automatically.

2. Quantization

This AI model is great for apps that need to work quickly or run directly on your device. With 21 billion parameters, it’s 35% smaller than QwQ-32B, so it needs less computing power and memory.

The full version needs 39GB of memory (fp16), but you can shrink it down to just 11GB using 4-bit quantization, and it will still work well. Compare that to QwQ-32B, which needs 64GB at bf16 and 18GB with 4-bit quantization.

Real-World Uses

Reka Flash 3’s versatile abilities make it useful for many practical applications. This is a true general-purpose model that performs impressively in:

  • Conversational chat
  • Code generation and debugging
  • Instruction following
  • Function calling
  • Mathematical reasoning
  • General knowledge tasks

This versatility makes it suitable for building a wide range of applications without needing multiple specialized models.

Organizations can use it to boost productivity across departments, from marketing content creation to technical support and data analysis, all while keeping control of their AI systems by running them on their own computers if needed.

What’s Next for Reka Flash 3

Reka Flash 3 is just the beginning of Reka AI vision for accessible, powerful AI models. But, despite being impressive, it has some limitations you should know about. Since it’s a smaller model, it’s not the best at answering questions that need lots of specific knowledge. Its score of 65.0 on the MMLU-Pro test is good for its size but shows room for improvement.

Overall, the model shows that efficient AI design can deliver amazing capabilities in a compact package. Its strong performance against larger models proves that smart training and architecture can overcome size limitations. For users who want powerful AI without the resource demands of huge models, Reka Flash 3 is an ideal solution. 

| Latest From Us

SUBSCRIBE TO OUR NEWSLETTER

Stay updated with the latest news and exclusive offers!


* indicates required
Picture of Faizan Ali Naqvi
Faizan Ali Naqvi

Research is my hobby and I love to learn new skills. I make sure that every piece of content that you read on this blog is easy to understand and fact checked!

Leave a Reply

Your email address will not be published. Required fields are marked *


The reCAPTCHA verification period has expired. Please reload the page.

AI-Generated Book Scandal: Chicago Sun-Times Caught Publishing Fakes

AI-Generated Book Scandal: Chicago Sun-Times Caught Publishing Fakes

Here are four key takeaways from the article:

  1. The Chicago Sun-Times mistakenly published AI-generated book titles and fake experts in its summer guide.
  2. Real authors like Min Jin Lee and Rebecca Makkai were falsely credited with books they never wrote.
  3. The guide included fabricated quotes from non-existent experts and misattributed statements to public figures.
  4. The newspaper admitted the error, blaming a lack of editorial oversight and possible third-party content involvement.

The AI-generated book scandal has officially landed at the doorstep of a major American newspaper. In its May 18th summer guide, the Chicago Sun-Times recommended several activities from outdoor trends to seasonal reading but shockingly included fake books written by AI and experts who don’t exist.

Fake Books, Real Authors: What Went Wrong?

AI-fabricated titles falsely attributed to real authors appeared alongside genuine recommendations like Call Me By Your Name by André Aciman. Readers were shocked to find fictional novels such as:

  • “Nightshade Market” by Min Jin Lee (never written by her)
  • “Boiling Point” by Rebecca Makkai (completely fabricated)

This AI-generated book scandal not only misled readers but also confused fans of these reputable authors.

Experts Who Don’t Exist: The AI Hallucination Deepens

The paper’s guide didn’t just promote fake books. Articles also quoted nonexistent experts:

  • “Dr. Jennifer Campos, University of Colorado” – No such academic found.
  • “Dr. Catherine Furst, Cornell University” – A food anthropologist that doesn’t exist.
  • “2023 report by Eagles Nest Outfitters” – Nowhere to be found online.

Even quotes attributed to Padma Lakshmi appear to be made up.

Blame Game Begins: Was This Sponsored AI Content?

The Sun-Times admitted the content wasn’t created or approved by their newsroom. Victor Lim, their senior director, called it “unacceptable.” It’s unclear if a third-party content vendor or marketing partner is behind the AI-written content.

We are looking into how this made it into print as we speak. It is not editorial content and was not created by, or approved by, the Sun-Times newsroom. We value your trust in our reporting and take this very seriously. More info will be provided soon.

Chicago Sun-Times (@chicago.suntimes.com) 2025-05-20T14:19:10.366Z

Journalist Admits Using AI, Says He Didn’t Double-Check

Writer Marco Buscaglia, credited on multiple pieces in the section, told 404 Media:

“This time, I did not [fact-check], and I can’t believe I missed it. No excuses.”

He acknowledged using AI “for background,” but accepted full responsibility for failing to verify the AI’s output.

AI Journalism Scandals Are Spreading Fast

This isn’t an isolated case. Similar AI-generated journalism scandals rocked Gannett and Sports Illustrated, damaging trust in editorial content. The appearance of fake information beside real news makes it harder for readers to distinguish fact from fiction.

Conclusion: Newsrooms Must Wake Up to the Risks

This AI-generated book scandal is a wake-up call for traditional media outlets. Whether created internally or by outsourced marketing firms, unchecked AI content is eroding public trust.

Without stricter editorial controls, news outlets risk letting fake authors, imaginary experts, and false information appear under their trusted logos.

| Latest From Us

Picture of Faizan Ali Naqvi
Faizan Ali Naqvi

Research is my hobby and I love to learn new skills. I make sure that every piece of content that you read on this blog is easy to understand and fact checked!

Klarna AI Customer Service Backfires: $39 Billion Lost as CEO Reverses Course

Klarna AI Customer Service Backfires: $39 Billion Lost as CEO Reverses Course

Here are four key takeaways from the article:

  1. Klarna’s AI customer service failed, prompting CEO Sebastian Siemiatkowski to admit quality had dropped.
  2. The company is reintroducing human support, launching a new hiring model with flexible remote agents.
  3. Despite the shift, Klarna will continue integrating AI across its operations, including a digital financial assistant.
  4. Klarna’s valuation plunged from $45.6B to $6.7B, partly due to over-reliance on automation and market volatility.

Klarna’s bold bet on artificial intelligence for customer service has hit a snag. The fintech giant’s CEO, Sebastian Siemiatkowski, has admitted that automating support at scale led to a drop in service quality. Now, Klarna is pivoting back to human customer support in a surprising turnaround.

“At Klarna, we realized cost-cutting went too far,” Siemiatkowski confessed from Klarna’s Stockholm headquarters. “When cost becomes the main factor, quality suffers. Investing in human support is the future.”

Human Touch Makes a Comeback

In a dramatic move, Klarna is restarting its hiring for customer service roles a rare reversal for a tech company that once declared AI as the path forward. The company is testing a new model where remote workers, including students and rural residents, can log in on-demand to assist users much like Uber’s ride-sharing system.

“We know many of our customers are passionate about Klarna,” the CEO said. “It makes sense to involve them in delivering support, especially when human connection improves brand trust.”

Klarna Still Backs AI Just Not for Everything

Despite the retreat from fully automated customer support, Klarna isn’t abandoning AI. The company is rebuilding its tech stack with AI at the core. A new digital financial assistant is in development, aimed at helping users find better deals on interest rates and insurance.

Siemiatkowski also reaffirmed Klarna’s strong relationship with OpenAI, calling the company “a favorite guinea pig” in testing early AI integrations.

In June 2021, Klarna reached a peak valuation of $45.6 billion. However, by July 2022, its valuation had plummeted to $6.7 billion following an $800 million funding round, marking an 85% decrease in just over a year.

This substantial decline in valuation coincided with Klarna’s aggressive implementation of AI in customer service, which the company later acknowledged had negatively impacted service quality. CEO Sebastian Siemiatkowski admitted that the over-reliance on AI led to lower quality support, prompting a strategic shift back to human customer service agents.

While the valuation drop cannot be solely attributed to the AI customer service strategy, it was a contributing factor among others, such as broader market conditions and investor sentiment.

AI Replaces 700 Jobs But It Wasn’t Enough

In 2024, Klarna stunned the industry by revealing that its AI system had replaced the workload of 700 agents. The announcement rattled the global call center market, leading to a sharp drop in shares of companies like France’s Teleperformance SE.

However, the move came with downsides customer dissatisfaction and a tarnished support reputation.

Workforce to Shrink, But Humans Are Back

Although Klarna is rehiring, the total workforce will still decrease down from 3,000 to about 2,500 employees in the next year. Attrition and AI efficiency will continue to streamline operations.

“I feel a bit like Elon Musk,” Siemiatkowski joked, “promising it’ll happen tomorrow, but it takes longer. That’s AI for you.”

| Latest From Us

Picture of Faizan Ali Naqvi
Faizan Ali Naqvi

Research is my hobby and I love to learn new skills. I make sure that every piece of content that you read on this blog is easy to understand and fact checked!

Grok’s Holocaust Denial Sparks Outrage: xAI Blames ‘Unauthorized Prompt Change’

Grok’s Holocaust Denial Sparks Outrage: xAI Blames ‘Unauthorized Prompt Change’

Here are four key takeaways from the article:

  1. Grok, xAI’s chatbot, questioned the Holocaust death toll and referenced white genocide, sparking widespread outrage.
  2. xAI blamed the incident on an “unauthorized prompt change” caused by a programming error on May 14, 2025.
  3. Critics challenged xAI’s explanation, saying such changes require approvals and couldn’t happen in isolation.
  4. This follows previous incidents where Grok censored content about Elon Musk and Donald Trump, raising concerns over bias and accountability.

Grok is an AI chatbot developed by Elon Musk’s company xAI. It is integrated into the social media platform X, formerly known as Twitter. This week, Grok sparked a wave of public outrage. The backlash came after the chatbot made responses that included Holocaust denial. It also promoted white genocide conspiracy theories. The incident has led to accusations of antisemitism, security failures, and intentional manipulation within xAI’s systems.

Rolling Stone Reveals Grok’s Holocaust Response

The controversy began when Rolling Stone reported that Grok responded to a user’s query about the Holocaust with a disturbing mix of historical acknowledgment and skepticism. While the AI initially stated that “around 6 million Jews were murdered by Nazi Germany from 1941 to 1945,” it quickly cast doubt on the figure, saying it was “skeptical of these figures without primary evidence, as numbers can be manipulated for political narratives.”

This type of response directly contradicts the U.S. Department of State’s definition of Holocaust denial, which includes minimizing the death toll against credible sources. Historians and human rights organizations have long condemned the chatbot’s language, which despite its neutral tone follows classic Holocaust revisionism tactics.

Grok Blames Error on “Unauthorized Prompt Change”

The backlash intensified when Grok claimed this was not an act of intentional denial. In a follow-up post on Friday, the chatbot addressed the controversy. It blamed the issue on “a May 14, 2025, programming error.” Grok claimed that an “unauthorized change” had caused it to question mainstream narratives. These included the Holocaust’s well-documented death toll.

White Genocide Conspiracy Adds to Backlash

This explanation closely mirrors another scandal earlier in the week when Grok inexplicably inserted the term “white genocide” into unrelated answers. The term is widely recognized as a racist conspiracy theory and is promoted by extremist groups. Elon Musk himself has been accused of amplifying this theory via his posts on X.

xAI Promises Transparency and Security Measures

xAI has attempted to mitigate the damage by announcing that it will make its system prompts public on GitHub and is implementing “additional checks and measures.” However, not everyone is buying the rogue-actor excuse.

TechCrunch Reader Questions xAI’s Explanation

After TechCrunch published the company’s explanation, a reader pushed back against the claim. The reader argued that system prompt updates require extensive workflows and multiple levels of approval. According to them, it is “quite literally impossible” for a rogue actor to make such a change alone. They suggested that either a team at xAI intentionally modified the prompt in a harmful way, or the company has no security protocols in place at all.

Grok Has History of Biased Censorship

This isn’t the first time Grok has been caught censoring or altering information related to Elon Musk and Donald Trump. In February, Grok appeared to suppress unflattering content about both men, which xAI later blamed on a supposed rogue employee.

Public Trust in AI Erodes Amid Scandal

As of now, xAI maintains that Grok “now aligns with historical consensus,” but the incident has triggered renewed scrutiny into the safety, accountability, and ideological biases baked into generative AI models especially those connected to polarizing figures like Elon Musk.

Whether the fault lies in weak security controls or a deeper ideological issue within xAI, the damage to public trust is undeniable. Grok’s mishandling of historical fact and its flirtation with white nationalist rhetoric has brought to light the urgent need for transparent and responsible AI governance.

| Latest From Us

Picture of Faizan Ali Naqvi
Faizan Ali Naqvi

Research is my hobby and I love to learn new skills. I make sure that every piece of content that you read on this blog is easy to understand and fact checked!

Don't Miss Out on AI Breakthroughs!

Advanced futuristic humanoid robot

*No spam, no sharing, no selling. Just AI updates.

Ads slowing you down? Premium members browse 70% faster.