Regular Expressions in Web Data Processing: a Case Study on Online Media Cover Image

Regular Expressions in Web Data Processing: a Case Study on Online Media
Regular Expressions in Web Data Processing: a Case Study on Online Media

Author(s): Yavor Tabov
Subject(s): Social Sciences, Economy, Media studies, Business Economy / Management, Communication studies, Sociology, Social Informatics, ICT Information and Communications Technologies
Published by: Университет за национално и световно стопанство (УНСС)
Keywords: Regular expressions; Web data processing; Data cleaning; Normalization
Summary/Abstract: Regular expressions (regex) are a versatile tool for processing web data, offering precision and flexibility in handling unstructured or semi-structured content. In the context of online media, they play a vital role in tasks like content extraction, data analysis, and automation of workflows. This study examines the features and applications of regular expressions in data processing. It focuses on practical use cases of regex in web data processing, particularly for online media platforms, such as data extraction, and the cleaning and normalization of raw text. Finally, the paper presents conclusions based on the research conducted.

Toggle Accessibility Mode