Don't expect quick fixes in 'red-teaming' of AI models. Security was an afterthought

@girlfreddy@lemmy.ca to Technology@beehaw.org
•
apnews.com
•
1Y
•

Don’t expect quick fixes in ‘red-teaming’ of AI models. Security was an afterthought

@girlfreddy@lemmy.ca to Technology@beehaw.org
•
apnews.com
•
1Y
•

White House officials concerned about AI chatbots' potential for societal harm and the Silicon Valley powerhouses rushing them to market are heavily invested in a three-day competition ending Sunday at the DefCon hacker convention in Las Vegas.

Current AI models are simply too unwieldy, brittle and malleable, academic and corporate research shows. Security was an afterthought in their training as data scientists amassed breathtakingly complex collections of images and text. They are prone to racial and cultural biases, and easily manipulated.

You must log in or register to comment.

HotTopNewOld

Chat

@Ubermeisters@lemmy.zip

6•1Y

Are they going to ‘red-team’ away adversarial prompting as well? Doubt it. Sooooo the issue is the input data. Always has been.

Technology

!technology@beehaw.org

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@beehaw.org

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

1 user online
60 users / day
245 users / week
568 users / month
2.51K users / 6 months
1 subscriber
3.17K Posts
65.7K Comments
Modlog