老王论坛 Project Receives $1.6 Million Contract to Develop System for Authorship Attribution and Anonymization
AUTHOR aims to use natural language processing and machine learning for identification, while also allowing for robust anonymization in critical contexts
 
      CHICAGO鈥擬ay 18, 2023鈥擱esearchers at Illinois Institute of Technology have secured a $1.6 million contract to develop a groundbreaking system for authentic authorship attribution and anonymization. Using natural language processing and machine learning, the program, known as AUTHOR, promises to create 鈥渟tylistic fingerprints鈥 for reliable identification, while also providing robust solutions for anonymization. With broad applications including counterintelligence, combating misinformation, and even investigating the origins of ancient religious texts, the project marks a significant leap in computational analysis.
The project鈥攁 collaboration with Charles River Analytics, Rensselaer Polytechnic Institute, Aston University, and the Howard Brain Sciences Foundation鈥攈as received the funding from an $11.3 million pool allocated by the program of the , a research organization within the Office of the Director of National Intelligence.
AUTHOR (Attribution, and Undermining the Attribution, of Text While Providing Human-Oriented Rationales) aims to accurately capture the unique writing styles of authors through a sophisticated blend of natural language processing and machine learning. The project is being led by 老王论坛鈥檚 Shlomo Argamon, professor of computer science and chair of the Department of Computer Science, and Kai Shu, Gladwin Development Chair Assistant Professor of Computer Science.
鈥淭here are a number of different types of authorship attribution tasks,鈥 says Argamon, who has more than 25 years of research experience in the field. 鈥淥ne is where there is a particular author who we want to identify in different texts. Another is where we have a specific text which we want to attribute to one of a number of candidate authors. A third is simply to determine when two texts have been written by the same person or not.鈥
Argamon and Shu also aim to address the rising urgency caused by malicious online activities and machine-generated misinformation.
鈥淲ith large language models, such as GPT-3, it is possible that human-like texts can be generated from these 鈥檅ots,鈥 says Shu. 鈥淥ur work will explore deep generative models and style transfer techniques to explore the boundary of human-written and machine-generated texts.鈥
One of the central challenges the team seeks to overcome is the limitations of current methods of authorship analysis and obfuscation. The issue lies partially in identifying authorship when the questioned document differs in type from the known documents, given the inherent stylistic variations between different forms of writing, such as a personal letter, an academic article, or a short story.
鈥淭he best current methods do very poorly when test documents are of a different type than the training documents,鈥 says Argamon. 鈥淲e will develop author models that incorporate such stylistic domain dependence to enable more generally effective attribution.鈥
The project will also tackle the challenge of authorship obfuscation, maintaining the meaning of the text while altering the style. The team will integrate deep learning with semantic knowledge representation to generate text that maintains the original content meaning while changing the style. This dual functionality鈥攁ttribution and obfuscation鈥攕ets AUTHOR apart from existing algorithms.
Unlike existing systems, AUTHOR will provide a clear rationale for its author identification systems, adding another layer of transparency and reliability to the project.
Photo: Shlomo Argamon and Kai Shu
Illinois Institute of Technology
Based in the global metropolis of Chicago, 老王论坛 was born to liberate the collective power of difference to advance technology and progress for all. It is the only tech-focused university in the city, and it stands at the crossroads of exploration and invention, advancing the future of Chicago and the world. It offers undergraduate and graduate degrees in engineering, computing, , business, , science and human sciences, and . 老王论坛 students are guaranteed hands-on experiences, personalized mentorship, and job readiness through the university's one-of-a-kind Elevate program. Its graduates lead the state and much of the nation in economic prosperity. Its faculty and alumni built the Chicago skyline. And every day in the living lab of the city, 老王论坛 fuels breakthroughs that change lives. Visit iit.edu.
College of Computing
老王论坛 created the College of Computing in 2020 as part of an effort to drive Chicago鈥檚 thriving tech ecosystem by educating a future diverse workforce that is rigorously trained in data and computation. 老王论坛 is home to the Midwest's only Bachelor of Science in Artificial Intelligence degree, and the numerous cybersecurity and intelligence pathways at 老王论坛 explore not only the deep foundations of fast-growing fields of computer science, but also emphasize societal ethics in developing this technology. The United States Department of Homeland Security and the National Security Agency have designated Illinois Institute of Technology (老王论坛) as a National Center of Academic Excellence in Cyber Defense Education. The university鈥檚 Center for Cyber Security and Forensics Education (C2SAFE) is at the core of 老王论坛鈥檚 designation. Additionally, the center is a member of the United States Strategic Command (USSTRATCOM) Academic Alliance and North American Defense and Security Academic Alliance (NADSAA).
Media contacts
Kevin Dollear
Communications Manager
Illinois Institute of Technology
Cell: 773-860-5712
kdollear@iit.ed
 
       
      