Artificial Intelligence

Artificial Intelligence @lemmy.sdf.org

Nemeski @lemm.ee

2d ago

www.theverge.com Wikipedia is giving AI developers its data to fend off bot scrapers

The dataset is even pre-formatted for machine learning.

Artificial Intelligence @lemmy.sdf.org

Nemeski @lemm.ee

5d ago

Introducing GPT-4.1 in the API

openai.com Just a moment...

Artificial Intelligence @lemmy.sdf.org

Fitik @fedia.io

1w ago

www.theverge.com Amazon plays catch-up with new Nova AI models to generate voices and video

Nova Sonic can detect your tone.

Artificial Intelligence @lemmy.sdf.org

Fitik @fedia.io

1w ago

WordPress.com launches a free AI-powered website builder

techcrunch.com WordPress.com launches a free AI-powered website builder | TechCrunch

WordPress.com has launched a new AI site builder that allows anyone to create a functioning website using an AI chat-style interface.

WordPress.com has launched a new AI site builder that allows anyone to create a functioning website using an AI chat-style interface.

Artificial Intelligence @lemmy.sdf.org

Tea @programming.dev

2w ago

DeepMind: An Approach to Technical AGI Safety and Security.

www.lesswrong.com Google DeepMind: An Approach to Technical AGI Safety and Security — LessWrong

We have written a paper on our approach to technical AGI safety and security. This post is primarily a copy of the extended abstract, which summarize…

Artificial General Intelligence (AGI) promises transformative benefits but also presents significant risks. We develop an approach to address the risk of harms consequential enough to significantly harm humanity. We identify four areas of risk: misuse, misalignment, mistakes, and structural risks. Of these, we focus on technical approaches to misuse and misalignment. For misuse, our strategy aims to prevent threat actors from accessing dangerous capabilities, by proactively identifying dangerous capabilities, and implementing robust security, access restrictions, monitoring, and model safety mitigations. To address misalignment, we outline two lines of defense. First, model-level mitigations such as amplified oversight and robust training can help to build an aligned model. Second, system-level security measures such as monitoring and access control can mitigate harm even if the model is misaligned. Techniques from interpretability, uncertainty estimation, and safer design patterns c

Artificial Intelligence @lemmy.sdf.org

Tea @programming.dev

2w ago

An AI avatar tried to argue a case before a New York court. The judges weren't having it.

apnews.com An AI avatar tried to argue a case before a New York court. The judges weren't having it

A man appearing before a New York court got a scolding from a judge after he tried to use an avatar generated by artificial intelligence to argue his case.

Artificial Intelligence @lemmy.sdf.org

Tea @programming.dev

2w ago

The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation.

ai.meta.com The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation

We’re introducing Llama 4 Scout and Llama 4 Maverick, the first open-weight natively multimodal models with unprecedented context support and our first built using a mixture-of-experts (MoE) architecture.

We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Llama 4 Scout, a 17 billion active parameter model with 16 experts, is the best multimodal model in the world in its class and is more powerful than all previous generation Llama models, while fitting in a single H100 GPU. Additionally, Llama 4 Scout offers an industry-leading context window of 10M and delivers better results than Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 across a broad range of widely reported benchmarks.
Llama 4 Maverick, a 17 billion active parameter model with 128 experts, is the best multimodal model in its class, beating GPT-4o and Gemini 2.0 Flash across a broad range of widely reported benchmarks, while achieving comparable results to the new DeepSeek v3 on reasoning and coding—at less than half the active parameters. Llama 4 Maverick offers a best-in-class performance to cost ratio with an experimental chat version s

Artificial Intelligence @lemmy.sdf.org

Nemeski @lemm.ee

2w ago

www.theverge.com The AI industry doesn’t know if Trump just killed its GPU supply

The new tariff plan is confusing — and the tech industry is scrambling to make sense of it.

Artificial Intelligence @lemmy.sdf.org

Hotznplotzn @lemmy.sdf.org

3w ago

Leaked data exposes a Chinese AI censorship machine

techcrunch.com Exclusive: Leaked data exposes a Chinese AI censorship machine

One academic who reviewed the dataset said it was "clear evidence" that China, or its affiliates, wants to use AI to improve repression.

cross-posted from: https://lemmy.sdf.org/post/31892983

Archived
TLDR:
China has developed an Artificial Intelligence (AI) system that adds to its already powerful censorship machine, scanning content for all kinds of topics like corruption, military issues, Taiwan politics, satire
The discovery was accidental, security researchers found an Elasticsearch database unsecured on the web, hosted by Chinese company Baidu
Experts highlight that AI-driven censorship is evolving to make state control over public discourse even more sophisticated, especially after recent releases like China's AI model DeepSeek
A complaint about poverty in rural China. A news report about a corrupt Communist Party member. A cry for help about corrupt cops shaking down entrepreneurs.
These are just a few of the 133,000 examples fed into a sophisticat

Artificial Intelligence @lemmy.sdf.org

Fitik @fedia.io

3w ago

composio.dev Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison

This blog-post compares the coding capabilites of new Gemini 2.5 Pro experimental and Claude 3.7 Sonnet (thinking)

This blog-post compares the coding capabilites of new Gemini 2.5 Pro experimental and Claude 3.7 Sonnet (thinking)

Artificial Intelligence @lemmy.sdf.org

JealousJail @feddit.org

3w ago

Is Cursor done?

With Cline + Gemini 2.5 Pro, one can get the exact same feature set that Cursor and Windsurf provide. They only call the APIs of the big LLM Providers without an advanced secret sauce.

It‘s even the opposite - they worsen model performance by limiting context size. The key advantage, the fixed monthly costs instead of variable API usage, is now gone with Gemini 2.5 Pro…

What is left that justifies their ridiculous valuation atm?

Artificial Intelligence @lemmy.sdf.org

Nemeski @lemm.ee

3w ago

www.technologyreview.com China built hundreds of AI data centers to catch the AI boom. Now many stand unused.

The country poured billions into AI infrastructure, but the data center gold rush is unraveling as speculative investments collide with weak demand and DeepSeek shifts AI trends.

Artificial Intelligence @lemmy.sdf.org

Nemeski @lemm.ee

3w ago

www.wired.com How Software Engineers Actually Use AI

We surveyed 730 coders and developers about how (and how often) they use AI chatbots on the job. The results amazed and disturbed us.

Artificial Intelligence @lemmy.sdf.org

Nemeski @lemm.ee

3w ago

mindmatters.ai AI researchers: Chatbots are spouting Russian, Chinese propaganda

The main point is that, for a free society, digital literacy matters more than ever.

Artificial Intelligence @lemmy.sdf.org

Nemeski @lemm.ee

3w ago

www.technologyreview.com The first trial of generative AI therapy shows it might help with depression

The evidence-backed model delivered impressive results, but it doesn’t validate the wave of AI therapy bots flooding the market.

Artificial Intelligence @lemmy.sdf.org

Tea @programming.dev

3w ago

Tracing the thoughts of a large language model.

www.anthropic.com Just a moment...

Artificial Intelligence @lemmy.sdf.org

Tea @programming.dev

3w ago

OpenAI's model behavior lead says OpenAI is “shifting from blanket refusals in sensitive areas to a more precise approach focused on preventing real-world harm”.

simonwillison.net Thoughts on setting policy for new AI capabilities

Joanne Jang leads model behavior at OpenAI. Their release of GPT-4o image generation included some notable relaxation of OpenAI's policies concerning acceptable usage - I [noted some of those](https://simonwillison.net/2025/Mar/25/introducing-4o-image-generation/) the …

Artificial Intelligence @lemmy.sdf.org

Fitik @fedia.io

4w ago

Google Blog

Gemini 2.5: Our most intelligent AI model

blog.google Gemini 2.5: Our most intelligent AI model

Gemini 2.5 is our most intelligent AI model, now with thinking.

Gemini 2.5 is our most intelligent AI model, now with thinking.

Artificial Intelligence @lemmy.sdf.org

Hotznplotzn @lemmy.sdf.org

4w ago

DeepSeek AI model can easily be breached for malware, security researcher Tenable warns

www.tenable.com DeepSeek Deep Dive Part 1: Creating Malware, Including Keyloggers and Ransomware

Tenable Research examines DeepSeek R1 and its capability to develop malware, such as a keylogger and ransomware. We found it provides a useful starting point, but requires additional prompting and debugging.

cross-posted from: https://lemmy.sdf.org/post/31583546

Archived
Security researcher Tenable successfully used DeepSeek to create a keylogger that could hide an encrypted log file on disk as well as develop a simple ransomware executable.
At its core, DeepSeek can create the basic structure for malware. However, it is not capable of doing so without additional prompt engineering as well as manual code editing for more advanced features. For instance, DeepSeek struggled with implementing process hiding. "We got the DLL injection code it had generated working, but it required lots of manual intervention," Tenable writes in its report.
**"Nonetheless, DeepSeek provides a useful compilation of techniques and search terms that can help someone with no prior experience in writing malicious code the ability to quickly famil

Artificial Intelligence @lemmy.sdf.org

Hotznplotzn @lemmy.sdf.org

4w ago

Trust Report DeepSeek R1: "Critical levels of risk with security and ethics, high levels of risk with privacy, stereotype, toxicity, hallucination, and fairness"

www.vijil.ai vijil Trust Report DeepSeek R1

Discover the VIJIL Trust Report for DeepSeek R1, a comprehensive evaluation of security, ethics, privacy, hallucination, and performance risks in this large language model (LLM). Our analysis identifies critical security and ethical risks, high privacy vulnerabilities, and moderate hallucination ris...

cross-posted from: https://lemmy.sdf.org/post/31552333

A Trust Report for DeepSeek R1 by VIJIL, a security resercher company, indicates critical levels of risk with security and ethics, high levels of risk with privacy, stereotype, toxicity, hallucination, and fairness, a moderate level of risk with performance, and a low level of risk with robustness.