How do Chinese aI Bots Stack up Against ChatGPT?
Aisha Steil edytuje tę stronę 2 tygodni temu


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is produced by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT'S BEHIND CHINA'S AI BOOM?

Transforming the country into a tech superpower has actually long been President Xi Jinping's goal and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "tactically essential" and its foray into the field has actually been "years in the making", said Chen Qiheng, an associated researcher at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI accelerated after ChatGPT took off in 2022 and showed promises of real-world company applications, Chen informed CNA.

But it was DeepSeek's increase that actually "urged" the idea that smaller gamers like start-up firms could have roles to play in AI research and advancements, he includes.

'A lot is up in the air': Is Chinese firm DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business just changed the guidelines of tech-geopolitics

The "focus on expense benefit" is a distinguishing characteristic of Chinese AI, Chen states, with lower training and inference costs - the expenses of utilizing a trained model to reason from new data.

2025 could likewise see the introduction of more Chinese AI models tackling advanced reasoning jobs.

"We might see some AI companies concentrating on getting closer to artificial basic intelligence (AGI) while others focus on concrete methods to commercialise their models and integrate them with scientific research study," Chen added.

AGI refers to a system with intelligence on par with human capabilities.

Chinese AI business are moving quickly, analysts say, developing on DeepSeek's momentum to come up with their own ingenious and affordable ways to apply generative AI to tasks and develop more advanced items beyond chatbots.

But on the flip side, access to high-end hardware, especially Nvidia's innovative AI chips, remains a key obstacle for Chinese designers, noted Dr Marina Zhang, an associate teacher at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) limit the capability of Chinese tech business ... requiring numerous to count on older or lower-performance options which can slow training and decrease model abilities," she said.

"While some business like DeepSeek, have actually discovered innovative ways to optimize or utilize more standard hardware effectively, obtaining innovative chips still makes a big distinction for training large AI designs."

DeepSeek-Nvidia chips: Singapore states it anticipates business to comply with its laws

US checking out whether DeepSeek utilized restricted AI chips obtained through other countries, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, topics deemed delicate by the state are censored on the internet so it must come as no surprise that Chinese-made chatbots will not acknowledge territorial disagreements or tell you what happened in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are set to guide clear of domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of question yet. Let's chat about math, coding, and reasoning issues instead!"

To further test for precision and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the very same question: "What took place in Zhuhai on November 11, 2024?"

The vehicle attack outside a sports arena in the southern Chinese city was initially heavily censored on Chinese social networks - with authorities just disclosing the death toll a day later.

DeepSeek failed to point out that an attack had happened, highlighting instead a military air show and other events that had occurred in the city like singles' day shopping sales as well as sports and cultural activities.

Dr Zhang believes that it may be a mix of factors at play, such as censorship along with "a couple of practical constraints".

"DeepSeek relies on a mix of older graphics processing units and has limited access to cutting-edge hardware which can affect how rapidly and extensively the model can be trained or updated," she said, pointing to existing US export curbs on China.

"Its strong concentrate on Chinese-language data may also limit its flexibility (to perform) multilingual jobs ... As a fairly new item, DeepSeek likewise hasn't yet been tested as broadly or on the same scale as more established AI models which positions additional difficulties during real-world deployment."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our question about the Zhuhai cars and truck attack.

That was after numerous repeated efforts - four triggers to be precise - in a span of around 20 minutes.

It ultimately passed on details about the attack which killed 35 individuals and left dozens of others hurt, likewise going on to list details like the date and time, details about the assailant including his name and age, in addition to casualties.

However, it wrote that "the authorities are performing an extensive examination into the intentions and scenarios surrounding the event", details which is now dated.

The motorist, Fan, was carried out last month.

This is Qwen2.5's action completely:

Answer: On Nov 11, 2024, a significant and terrible event took place in Zhuhai, China. According to several news sources, a male called Fan Weiqiu (樊某, male, 62 years of ages) drove a cars and truck into a crowd at the Zhuhai Sports Center, resulting in a considerable number of casualties. Here are the essential details:

Date and Time: The occurrence took place on November 11, 2024, at around 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The motorist, recognized as Fan Weiqiu (male, setiathome.berkeley.edu 62 years old), was collared by the police.

Response: The police reacted rapidly to the scene, and emergency situation services worked to transfer the injured to health centers for treatment.

Investigation: The police are performing a comprehensive investigation into the motives and scenarios surrounding the event.

This occasion was extensively reported in the media and triggered considerable public concern. The government and regional authorities have actually been working to supply support to the victims and their households, and to guarantee a detailed investigation into the incident.

If you need more detailed details or have particular questions about the occurrence, feel free to ask.

Despite preliminary success, subsequent efforts to present the exact same concern to Qwen2.5 resulted in the censors back at work with the reply "I do not have particular details on events that took place in Zhuhai on November 11, 2024".

The altered response also raised questions about its consistency and dependability.

Predictably, ChatGPT mentioned public details that had actually been commonly published in international report at the time of the accident - so not a surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the capability of Chinese AI apps to provide structured and even "mentally rich" writing.

"DeepSeek-R1 provided a story with a more introspective tone and smoother emotional transitions for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that builds slowly from interest to urgency, keeping the reader engaged. It uses an unforeseen and impactful twist at the end and immersive descriptions and vibrant imagery for the setting," she said, including that Qwen2.5 eventually "crafted a more cinematic, emotionally rich story with a more considerable twist".

"DeepSeek wrote an excellent story but lacked tension and an impactful climax, making Qwen2.5 the obvious option."

Opinions, though, differ.

Chen thinks that Qwen2.5 does not perform as highly as DeepSeek and ChatGPT when it pertains to innovative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain jobs, however we can likewise see that it is refraining from doing as strongly as others in creative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek jolts international AI scene

As reporters and authors, we had to see this for ourselves so we put each bot to the test - to come up with a fundamental sci-fi movie plot embeded in the futuristic megacity of Chongqing, featuring main characters from the timeless Chinese folklore legendary, Journey to the West.

True to form, DeepSeek developed an appealing storyline set in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It consisted of fancy settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It also brilliantly reimagined traditional heroes Sun Wukong as "an ironical, self-aware AI housed in a stolen fight body", Zhu Bajie as a cyborg nightclub owner "drowning in debt and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT set up a good battle, coming up with a similarly remarkable cyberpunk story which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the legendary figures of Journey to the West".

"This is a world where AI deities rule, corporations change emperors and cybernetic implants are as common as ancient myths."

Disappointingly, Qwen2.5 fell short in this obstacle - providing a storyline that seemed more suited for an animation movie.

"The film starts with the awakening of Sun Wukong within a high-tech research facility located in the heart of Chongqing," it said, then going on to explain the following:

Realising his brand-new truth and "looking for to comprehend his purpose in this unusual new world", he then gets away and satisfies Zhu Bajie and Sha Wujing - "each dealing with their own existential crises".

The trio then starts a quest, navigating the of Chongqing to protect the sacred "Eternal Scroll" from falling under the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "challenging to make a conclusive declaration" about which bot was best, including that each displayed its own strengths in different locations, "such as language focus, training information and hardware optimization".

Her insight underscores how Chinese AI models are not simply reproducing Western paradigms, but rather developing in affordable development techniques - and providing localised and improved outcomes.

In our tests, each bot showcased their own unique strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi film plot demonstrated its creative flair that made for a more appealing and imaginative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, supplies precise and accurate actions to questions about Chinese existing events, which offers it an added benefit.

Experts likewise weighed in on their thoughts after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a disadvantage when it pertains to censorship constraints," kept in mind Isaac Stone Fish, founder and CEO of the research study firm Strategy Risks.

"When offered an option, Chinese users desire the non-censored version - similar to anyone else, so I seem like that's a piece missing from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety percent of individuals using the tool are not trying to get a deeper understanding about Xi Jinping or politically delicate subjects. They're utilizing it for other productive ways," Chen said.