What risk is there that generative AI evolves such that the content it generates can avoid detection faster than tools can be developed to detect it? How can international and industry collaboration limit this risk?

Background

Although there is already material evidence on the types of serious harms individuals encounter online, there still remain a number of emerging harms, where the evidence base is still yet to mature (e.g. epilepsy trolling, online animal abuse). SOH would like to close this significant gap in understanding the impact of encountering different types of serious harms online and understanding the best approaches to measuring the impact of the Online Safety legislation.

SOH highlights the importance of Media Literacy in the digital age and asks for further studies to uncover barriers to engagement as well as the effectiveness of DSIT programmes. This issue closely relates to Counter-Disinformation interventions, which requires evidence for its effect on bystanders, topic specific disinformation and what tools can be used to combat this issue.

Research on Safety Technology would greatly develop SOH’s understanding of the relationship that DSIT online safety objectives have with the technology market today. A primary focus lands on improving Age Assurance (AA) measures. This includes ensuring transparency and assessing opportunities for the sector.

Next steps

If you are keen to register your interest in working and connecting with DSIT Digital Technology and Telecoms Group and/or submitting evidence, then please complete the DSIT-ARI Evidence survey - https://dsit.qualtrics.com/jfe/form/SV_cDfmK2OukVAnirs.
Please view full details: https://www.gov.uk/government/publications/department-for-science-innovation-and-technology-areas-of-research-interest/dsit-areas-of-research-interest-2024

Source

This question was published as part of the set of ARIs in this document:

DSIT Areas of Research Interest 2024 GOV UK

Related UKRI funded projects


  • Safe Internet surfing with an intelligent child-centred shield against harmful content

    The Internet provides high exposure to malicious content with direct impact on children's safety. Illicit, violent and pornographic material to name a few. The Internet is also an enabler for cyber victimisation such as ...

    Funded by: Innovate UK

    Lead research organisation: UNIVERSITY OF WOLVERHAMPTON

    Why might this be relevant?

    The project focuses on developing a child-centered shield against harmful online content, directly addressing the risk of generative AI creating harmful content.

  • Shear: The next generation of video understanding technology to automate content moderation across the internet.

    In this project, Unitary Ltd and Oxford University will develop novel algorithms to address the core challenges of video moderation. This technology will form Unitary's new product, _Shear_, to automatically detect harmf...

    Funded by: Innovate UK

    Lead research organisation: UNITARY LTD

    Why might this be relevant?

    The project aims to automate content moderation for harmful videos, which is directly related to the risk of generative AI creating undetectable content.

  • Detox: Human-led AI to automate and radically improve online content moderation

    In this project, Rewire Online Limited will develop and commercially trial a new Artificial Intelligence (AI)-powered product, _Detox_, which massively improve show platforms moderate online content. Detox will automatic...

    Funded by: Innovate UK

    Lead research organisation: REWIRE ONLINE LIMITED

    Why might this be relevant?

    The project focuses on developing AI-powered content moderation to tackle toxic online content, aligning with the need to detect harmful content generated by AI.

Similar ARIs from other organisations