Publications

A list of selected papers in which research team members participated.
(For a full list see below or go to Google Scholar (Jisun An and Haewoon Kwak).

Modeling Political Activism around Gun Debate via Social Media

The United States have some of the highest rates of gun violence among developed countries. Yet, there is a disagreement about the extent to which firearms should be regulated. In this study, we employ social media signals to examine the predictors of offline political activism, at both population and individual level. We show that it is possible to classify the stance of users on the gun issue, especially accurately when network information is available. Alongside socioeconomic variables, network information such as the relative size of the two sides of the debate is also predictive of state-level gun policy. On individual level, we build a statistical model using network, content, and psycho-linguistic features that predicts real-life political action, and explore the most predictive linguistic features. Thus, we argue that, alongside demographics and socioeconomic indicators, social media provides useful signals in the holistic modeling of political engagement around the gun debate.

Yelena Mejova, Jisun An, Gianmarco De Francisci Morales, Haewoon Kwak

ACM Transactions on Social Computing, 2022

Storm the Capitol: Linking Offline Political Speech and Online Twitter Extra-Representational Participation on QAnon and the January 6 Insurrection

The transfer of power stemming from the 2020 presidential election occurred during an unprecedented period in United States history. Uncertainty from the COVID-19 pandemic, ongoing societal tensions, and a fragile economy increased societal polarization, exacerbated by the outgoing president’s offline rhetoric. As a result, online groups such as QAnon engaged in extra political participation beyond the traditional platforms. This research explores the link between offline political speech and online extra-representational participation by examining Twitter within the context of the January 6 insurrection. Using a mixed-methods approach of quantitative and qualitative thematic analyses, the study combines offline speech information with Twitter data during key speech addresses leading up to the date of the insurrection; exploring the link between Trump’s offline speeches and QAnon’s hashtags across a 3-day timeframe. We find that links between online extra-representational participation and offline political speech exist. This research illuminates this phenomenon and offers policy implications for the role of online messaging as a tool of political mobilization.

Claire Seungeun Lee, Juan Merizalde, John D. Colautti, Jisun An and Haewoon Kwak

Frontiers in Sociology, 2022

Press coverage-PsyPost

Measuring 9 Emotions of News Posts from 8 News Organizations across 4 Social Media Platforms for 8 Months

Using Plutchik’s wheel of emotions framework, we identify the emotional content of 133,487 social media posts and the audience’s emotional engagement expressed in 2,824,162 comments on those posts. We measure nine emotions (anger, anticipation, anxiety, disgust, joy, fear, sadness, surprise, trust) and two sentiments (positive and negative) using two extraction resources (EmoLex, LIWC) for eight major news outlets across four social media platforms (Facebook, Instagram, Twitter, and YouTube) during eight months. We then apply two approaches (Logistic Regression, Long Short-Term Memory) to predict emotional audience reactions before and after publishing the posts. …

Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen

ACM Transactions on Social Computing, 2022

Understanding Toxicity Triggers on Reddit in the Context of Singapore

While the contagious nature of online toxicity sparked increasing interest in its early detection and prevention, most of the literature focuses on the Western world. In this work, we demonstrate that 1) it is possible to detect toxicity triggers in an Asian online community, and 2) toxicity triggers can be strikingly different between Western and Eastern contexts.

Yun Yu Chong, Haewoon Kwak

Proceedings of the 16th International AAAI Conference on Web and Social Media (ICWSM), 2022 (short)

Who Is Missing? Characterizing the Participation of Different Demographic Groups in a Korean Nationwide Daily Conversation Corpus

A conversation corpus is essential to build interactive AI applications. However, the demographic information of the participants in such corpora is largely underexplored mainly due to the lack of individual data in many corpora. In this work, we analyze a Korean nationwide daily conversation corpus constructed by the National Institute of Korean Language (NIKL) to characterize the participation of different demographic (age and sex) groups in the corpus.

Haewoon Kwak, Jisun An, Kunwoo Park

Proceedings of the 16th International AAAI Conference on Web and Social Media (ICWSM), 2022 (short)

What really matters?: characterising and predicting user engagement of news postings using multiple platforms, sentiments and topics

This research characterises user engagement of approximately 3,000,000 news postings of 53 news outlets and 50,000,000 associated user comments during 8 months on 5 social media platforms (i.e. Facebook, Instagram, Twitter, YouTube, and Reddit). We investigate the effect of sentiments and topics on user engagement across four levels of user engagement expressions (i.e. views, likes, comments, cross-platform posting). We find that sentiments and topics differ by both news outlets and social media platforms, and both sentiments and topics by the four levels of user engagement expression. …

Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen

Behaviour & Information Technology, 2022

Predicting Anti-Asian Hateful Users on Twitter during COVID-19

We investigate predictors of anti-Asian hate among Twitter users throughout COVID-19. With the rise of xenophobia and polarization that has accompanied widespread social media usage in many nations, online hate has become a major social issue, attracting many researchers. Here, we apply natural language processing techniques to characterize social media users who began to post anti-Asian hate messages during COVID-19. We compare two user groups – those who posted anti-Asian slurs and those who did not – with respect to a rich set of features measured with data prior to COVID-19 and show that it is possible to predict who later publicly posted anti-Asian slurs. …

Jisun An, Haewoon Kwak, Claire Seungeun Lee, Bogang Jun, Yong-Yeol Ahn

Findings of the Association for Computational Linguistics EMNLP 2021

Code repo (github)

Precision Public Health Campaign: Delivering Persuasive Messages to Relevant Segments Through Targeted Advertisements on Social Media

We propose a novel precision public health campaign framework to structure and standardize the process of designing and delivering tailored health messages to target particular population segments using social media–targeted advertising tools. Our framework consists of five stages - defining a campaign goal, priority audience, and evaluation metrics; splitting the target audience into smaller segments; tailoring the message for each segment and conducting a pilot test; running the health campaign formally; and evaluating the performance of the campaigns. We have demonstrated how the framework works through 2 case studies. The precision public health campaign framework has the potential to support higher population uptake and engagement rates by encouraging a more standardized, concise, efficient, and targeted approach to public health campaign development.

Jisun An, Haewoon Kwak, Hanya M Qureshi, Ingmar Weber

JMIR Form Res 2021;5(9):e22313, 2021

FrameAxis: characterizing microframe bias and intensity with word embedding

Framing is a process of emphasizing a certain aspect of an issue over the others, nudging readers or listeners towards different positions on the issue even without making a biased argument. Here, we propose FrameAxis, a method for characterizing documents by identifying the most relevant semantic axes (“microframes”) that are overrepresented in the text using word embedding. Our unsupervised approach can be readily applied to large datasets because it does not require manual annotations. …

Haewoon Kwak, Jisun An, Elise Jing, Yong-Yeol Ahn

PeerJ Computer Science 7:e644, 2021

Code repo (github)

Populist Supporters on Reddit: A Comparison of Content and Behavioral Patterns Within Publics of Supporters of Donald Trump and Hillary Clinton

We investigate differences along these dimensions on the online forum Reddit by comparing linguistic patterns and content of comments in two subreddits focusing on a populist, Donald Trump (/r/The_Donald), and a center-left politician, Hillary Clinton (/r/hillaryclinton), during the 2016 U.S. presidential election campaign.

Andreas Jungherr, Oliver Posegga, Jisun An

Social Science Computer Review. March 2021.

How-to Present News on Social Media: A Causal Analysis of Editing News Headlines for Boosting User Engagement

We first build a parallel corpus of original news articles and their corresponding tweets that were shared by eight media outlets. Then, we explore how those media edited tweets against original headlines, and the effects would be..

Kunwoo Park, Haewoon Kwak, Jisun An, Sanjay Chawla

Proceedings of the 15th International AAAI Conference on Web and Social Media (ICWSM), 2021

A Systematic Media Frame Analysis of 1.5 Million New York Times Articles from 2000 to 2017

Framing is an indispensable narrative device for news media because even the same facts may lead to conflicting understandings if deliberate framing is employed. By developing a media frame classifier that achieves state-of-the-art performance, we systematically analyze the media frames of 1.5 million New York Times articles published from 2000 to 2017.

Haewoon Kwak, Jisun An, Yong-Yeol Ahn

Proceedings of the 12th ACM Conference on Web Science (WebSci), 2020

Identifying and Characterizing Alternative News Media on Facebook

In this work, we propose a graph-based semi-supervised method to measure the political bias of pages on most countries and show the political split of the alternative media, mainstream media, and public figures pages. We validate our method using the publicly available U.S. dataset and then apply it to Brazilian pages, where we found a larger number of right-wing pages in general, except for alternative news media.

Samuel S Guimarães, Julio CS Reis, Lucas Lima, Filipe N Ribeiro, Marisa Vasconcelos, Jisun An, Haewoon Kwak, Fabrício Benevenuto

IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2020

What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context

A first paper from the TBG collaboration is online, with beautiful data from the Baumberger group in Geneva, and great devices from the Efetov group at ICFO. We measured STM on the device.

Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass, Preslav Nakov

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL) (2020)

Empirical Evaluation of Three Common Assumptions in Building Political Media Bias Datasets

We empirically validate three common assumptions in building political media bias datasets, which are (i) labelers’ political leanings do not affect labeling tasks, (ii) news articles follow their source outlet’s political leaning, and (iii) political leaning of a news outlet is stable across different topics.

Soumen Ganguly, Juhi Kulshrestha, Jisun An, Haewoon Kwak

Proceedings of the 14th International AAAI Conference on Web and Social Media (ICWSM), 2020

“Trust Me, I Have a Ph.D.”: A Propensity Score Analysis on the Halo Effect of Disclosing One's Offline Social Status in Online Communities

We study two Reddit communities that adopted this scheme, whereby posts include tags identifying education status referred to as flairs, and we examine how the “transferred” social status affects the interactions among the users.

Kunwoo Park, Haewoon Kwak, Hyunho Song, Meeyoung Cha

Proceedings of the 14th International AAAI Conference on Web and Social Media (ICWSM), 2020

Are These Comments Triggering? Predicting Triggers of Toxicity in Online Discussions

We define toxicity triggers in online discussions as a non-toxic comment that lead to toxic replies. Then, we build a neural network-based prediction model for toxicity trigger.

Hind Almerekhi, Haewoon Kwak, Bernard Jim Jansen, Joni Salminen (short)

Proceedings of The Web Conference (WWW), 2020

Going beyond accuracy: estimating homophily in social networks using predictions

We show that estimating homophily in a network can be viewed as a dyadic prediction problem, and that homophily estimates are unbiased when dyad-level residuals sum to zero in the network. Then, we propose a novel “ego-alter” modeling approach that outperforms standard node and dyad classification strategies.

George Berry, Antonio Sirianni, Ingmar Weber, Jisun An, Michael Macy (preprint)

arXiv preprint arXiv:2001.11171, 2020

Tanbih: Get To Know What You Are Reading

We introduce Tanbih, a news aggregator with intelligent analysis tools to help readers understanding what’s behind a news story. Our system displays news grouped into events and generates media profiles that show the general factuality of reporting, the degree of propagandistic content, hyper-partisanship, leading political ideology, general frame of reporting, and stance with respect to various claims and topics of a news outlet.

Yifan Zhang, Giovanni Da San Martino, Alberto Barrón-Cedeño, Salvatore Romeo, Jisun An, Haewoon Kwak, Todor Staykovski, Israa Jaradat, Georgi Karadzhov, Ramy Baly, Kareem Darwish, James Glass, Preslav Nakov (demo)

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019

Gender and Racial Diversity in Commercial Brands' Advertising Images on Social Media

Gender and racial diversity in the mediated images from the media shape our perception of different demographic groups. In this work, we investigate gender and racial diversity of 85,957 advertising images shared by the 73 top international brands on Instagram and Facebook.

Jisun An, Haewoon Kwak

Proceedings of Social Informatics (SocInfo), 2019

Best Paper Award

Political Discussions in Homogeneous and Cross-Cutting Communication Spaces

We use Reddit to explore the nature of political discussionsin homogeneous and cross-cutting communication spaces. Inparticular, we develop an analytical template to studyinter-actionandlinguistic patternswithin and between politicallyhomogeneous and heterogeneous communication spaces. Ouranalyses reveal different behavioral patterns in homogeneousand cross-cutting communications spaces.

Jisun An, Haewoon Kwak, Oliver Posegga, Andreas Jungherr

Proceedings of the 13th International AAAI Conference on Web and Social Media (ICWSM), 2019

View, Like, Comment, Post: Analyzing User Engagement by Topic at 4 Levels across 5 Social Media Platforms for 53 News Organizations

We evaluate the effects of the topics of social media posts on audiences across five social media platforms (i.e., Facebook, Instagram, Twitter, YouTube, and Reddit) at four levels of user engagement. We collected 3,163,373 social posts from 53 news organizations across five platforms during an 8month period.

Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen

Proceedings of the 13th International AAAI Conference on Web and Social Media (ICWSM), 2019

Discursive Power in Contemporary Media Systems: A Comparative Framework

We propose the concept of discursive power. This describes the ability of contributors to communication spaces to introduce, amplify, and maintain topics, frames, and speakers, thus shaping public discourses and controversies that unfold in interconnected communication spaces.

Andreas Jungherr, Oliver Posegga, Jisun An

The International Journal of Press/Politics, 24(4), 2019

Imaginary People Representing Real Numbers: Generating Personas from Online Social Media Data

We develop a methodology to automate creating imaginary people, referred to as personas, by processing complex behavioral and demographic data of social media audiences. From a popular social media account containing more than 30 million interactions by viewers from 198 countries engaging with more than 4,200 online videos produced by a global media corporation, we demonstrate that our methodology has several novel accomplishments.

Jisun An, Haewoon Kwak, Soon-gyo Jung, Joni Salminen, M. Admad, Bernard J. Jansen

ACM Transactions on the Web, 12(4), 2018

Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media

We manually label 5,143 hateful expressions posted to YouTube and Facebook videos among a dataset of 137,098 comments from an online news media. We then create a granular taxonomy of different types and targets of online hate and train machine learning models to automatically detect and classify the hateful comments in the full dataset.

Joni Salminen, Hind Almerekhi, Milica Milenković, Soon-gyo Jung, Jisun An, Haewoon Kwak, Bernard J. Jansen

Proceedings of the 12th International AAAI Conference on Web and Social Media (ICWSM), 2018

Identifying Regional Trends in Avatar Customization

We manually label 5,143 hateful expressions posted to YouTube and Facebook videos among a dataset of 137,098 comments from an online news media. We then create a granular taxonomy of different types and targets of online hate and train machine learning models to automatically detect and classify the hateful comments in the full dataset.

Peter Mawhorter, Sercan Şengün, Haewoon Kwak, D. Fox Harrell

IEEE Transactions on Games, 10(2), 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment

We propose SemAxis, a simple yet powerful framework to characterize word semantics using many semantic axes in word-vector spaces beyond sentiment. We demonstrate that SemAxis can capture nuanced semantic representations in multiple online communities. We also show that, when the sentiment axis is examined, SemAxis outperforms the state-of-the-art approaches in building domain-specific sentiment lexicons.

Jisun An, Haewoon Kwak, Yong-Yeol Ahn

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), 2018

What We Read, What We Search: Media Attention and Public Attention among 193 Countries

We investigate the alignment of international attention of news media organizations within 193 countries with the expressed international interests of the public within those same countries from March 7, 2016 to April 14, 2017. We collect fourteen months of longitudinal data of online news from Unfiltered News and web search volume data from Google Trends and build a multiplex network of media attention and public attention in order to study its structural and dynamic properties.

Haewoon Kwak, Jisun An, Joni Salminen, Soon-Gyo Jung, Bernard J. Jansen.

Proceedings of the 2018 World Wide Web Conference (WWW), 2018

What is Gab? A Bastion of Free Speech or an Alt-Right Echo Chamber?

We provide, to the best of our knowledge, the first characterization of Gab. We collect and analyze 22M posts produced by 336K users between August 2016 and January 2018, finding that Gab is predominantly used for the dissemination and discussion of news and world events, and that it attracts alt-right users, conspiracy theorists, and other trolls

Savvas Zannettou, Barry Bradlyn, Emiliano De Cristofaro, Haewoon Kwak, Michael Sirivianos, Gianluca Stringhini, Jeremy Blackburn

Companion Proceedings of the The Web Conference (WWW), 2018

Press coverage-New Scientist, and Vice

“Is More Better?”: Impact of Multiple Photos on Perception of Persona Profiles

We investigate if and how more photos than a single headshot can heighten the level of information provided by persona profiles. We conduct eye-tracking experiments and qualitative interviews with variations in the photos-a single headshot, a headshot and images of the persona in different contexts, and a headshot with pictures of different people representing key persona attributes.

Joni Salminen, Lene Nielsen, Soon-Gyo Jung, Jisun An, Haewoon Kwak, Bernard J. Jansen

Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI), 2018

Data-driven Approach to Measuring the Level of Press Freedom Using Media Attention Diversity from Unfiltered News

Published by Reporters Without Borders every year, the Press Freedom Index (PFI) reflects the fear and tension in the newsroom pushed by the government and private sectors. While the PFI is invaluable in monitoring media environments worldwide, the current survey-based method has inherent limitations to updates in terms of cost and time. In this work, we introduce an alternative way to measure the level of press freedom using media attention diversity compiled from Unfiltered News.

Jisun An, Haewoon Kwak

Proceedings of the ICWSM Workshop on NEws and publiC Opinion (NECO), 2017

Picked as The Best of the Physics arXiv (week ending April 15, 2017) in MIT Technology Review

I Would Not Plant Apple Trees If the World Will Be Wiped: Analyzing Hundreds of Millions of Behavioral Records of Players During an MMORPG Beta Test

We use player behavior during the closed beta test of the MMORPG ArcheAge as a proxy for an extreme situation-at the end of the closed beta test, all user data is deleted, and thus, the outcome (or penalty) of players’ in-game behaviors in the last few days loses its meaning. We analyzed 270 million records of player behavior in the 4th closed beta test of ArcheAge.

Ah Reum Kang, Jeremy Blackburn, Haewoon Kwak, Huy Kang Kim

Proceedings of the 26th International Conference on World Wide Web (WWW) Companion, 2017

Press coverage-New Scientist, IFL Science, PC Gamer, Massively OK, El Confidencial, Joongang Ilbo, and so on.

Are You Charlie or Ahmed? Cultural Pluralism in Charlie Hebdo Response on Twitter

We study the response to the Charlie Hebdo shootings of January 7, 2015 on Twitter across the globe. We ask whether the stances on the issue of freedom of speech can be modeled using established sociological theories, including Huntington’s culturalist Clash of Civilizations, and those taking into consideration social context, including Density and Interdependence theories. We find support for Huntington’s culturalist explanation, in that the established traditions and norms of one’s “civilization” predetermine some of one’s opinion.

Jisun An, Haewoon Kwak, Yelena Mejova, Sonia Alonso Saenz De Oger, Braulio Gomez Fortes

Proceeding of the 10th International Conference on Web and Social Media (ICWSM), 2016

Searching for a Unique Style in Soccer

We introduce the the concept of “flow motifs” to characterize the statistically significant pass sequence patterns. It extends the idea of the network motifs, highly significant subgraphs that usually consists of three or four nodes. The analysis of the motifs in the pass networks allows us to compare and differentiate the styles of different teams. Although most teams tend to apply homogenous style, surprisingly, a unique strategy of soccer exists. Specifically, FC Barcelona’s famous tiki-taka does not consist of uncountable random passes but rather has a precise, finely constructed structure.

Laszlo Gyarmati, Haewoon Kwak, Pablo Rodriguez

KDD Workshop on Large-Scale Sports Analytics, 2014

Press coverage-BBC, MIT Technology Review, The Times, The Economist, Slate, Pacific Standard, and so on.

STFU NOOB! Predicting Crowdsourced Decisions on Toxic Behavior in Online Games

We propose a supervised learning approach for predicting crowdsourced decisions on toxic behavior with large-scale labeled data collections; over 10 million user reports involved in 1.46 million toxic players and corresponding crowdsourced decisions. Our result shows good performance in detecting overwhelmingly majority cases and predicting crowdsourced decisions on them. We demonstrate good portability of our classifier across regions.

Jeremy Blackburn, Haewoon Kwak

Proceedings of the 23rd international conference on World wide web (WWW), 2014

Press coverage-Nature, Scientific American, Chosun Ilbo

100+ papers citing this work (Google scholar)

Recommending investors for crowdfunding projects

One of the most popular crowdfunding sites is Kickstarter. In it, creators post descriptions of their projects and advertise them on social media sites (mainly Twitter), while investors look for projects to support. We set out to propose different ways of recommending investors found on Twitter for specific Kickstarter projects. We do so by conducting hypothesis-driven analyses of pledging behavior and translate the corresponding findings into different recommendation strategies. The best strategy achieves, on average, 84% of accuracy in predicting a list of potential investors’ Twitter accounts for any given project.

Jisun An, Daniele Quercia, Jon Crowcroft

Proceedings of the 23rd international conference on World wide web (WWW), 2014

Press coverage-FastCompany

What is Twitter, a Social Network or a News Media?

We have crawled the entire Twitter site and obtained 41.7 million user profiles, 1.47 billion social relations, 4,262 trending topics, and 106 million tweets. In its follower-following topology analysis we have found a non-power-law follower distribution, a short effective diameter, and low reciprocity, which all mark a deviation from known characteristics of human social networks [28]. In order to identify influentials on Twitter, we have ranked users by the number of followers and by PageRank and found two rankings to be similar. Ranking by retweets differs from the previous two rankings, indicating a gap in influence inferred from the number of followers and that from the popularity of one’s tweets.

Haewoon Kwak, Changhyun Lee, Hosung Park, Sue Moon

Proceedings of the 19th international conference on World wide web (WWW), 2010.

Press coverage - Mashable Op-Ed, ReadWrite, The Guardian, PC News, Chosun Ilbo, DongA Ilbo

8000+ papers citing this work (Google scholar)

 

Full List

Modeling Political Activism around Gun Debate via Social Media
Yelena Mejova, Jisun An, Gianmarco De Francisci Morales, Haewoon Kwak
ACM Transactions on Social Computing, 2022

Storm the Capitol: Linking Offline Political Speech and Online Twitter Extra-Representational Participation on QAnon and the January 6 Insurrection
Claire Seungeun Lee, Juan Merizalde, John D. Colautti, Jisun An and Haewoon Kwak
Frontiers in Sociology, 2022
Press coverage-PsyPost

Measuring 9 Emotions of News Posts from 8 News Organizations across 4 Social Media Platforms for 8 Months
Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen
ACM Transactions on Social Computing, 2022

Understanding Toxicity Triggers on Reddit in the Context of Singapore
Yun Yu Chong, Haewoon Kwak
Proceedings of the 16th International AAAI Conference on Web and Social Media (ICWSM), 2022 (short)

Who Is Missing? Characterizing the Participation of Different Demographic Groups in a Korean Nationwide Daily Conversation Corpus
Haewoon Kwak, Jisun An, Kunwoo Park
Proceedings of the 16th International AAAI Conference on Web and Social Media (ICWSM), 2022 (short)

What really matters?: characterising and predicting user engagement of news postings using multiple platforms, sentiments and topics
Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen
Behaviour & Information Technology, 2022

Predicting Anti-Asian Hateful Users on Twitter during COVID-19
Jisun An, Haewoon Kwak, Claire Seungeun Lee, Bogang Jun, Yong-Yeol Ahn
Findings of the Association for Computational Linguistics EMNLP 2021

Code repo (github)

Precision Public Health Campaign: Delivering Persuasive Messages to Relevant Segments Through Targeted Advertisements on Social Media
Jisun An, Haewoon Kwak, Hanya M Qureshi, Ingmar Weber
JMIR Form Res 2021;5(9):e22313, 2021

FrameAxis: characterizing microframe bias and intensity with word embedding
Haewoon Kwak, Jisun An, Elise Jing, Yong-Yeol Ahn
PeerJ Computer Science 7:e644, 2021

Code repo (github)

Populist Supporters on Reddit: A Comparison of Content and Behavioral Patterns Within Publics of Supporters of Donald Trump and Hillary Clinton
Andreas Jungherr, Oliver Posegga, Jisun An
Social Science Computer Review. March 2021.

How-to Present News on Social Media: A Causal Analysis of Editing News Headlines for Boosting User Engagement
Kunwoo Park, Haewoon Kwak, Jisun An, Sanjay Chawla
Proceedings of the 15th International AAAI Conference on Web and Social Media (ICWSM), 2021

A Systematic Media Frame Analysis of 1.5 Million New York Times Articles from 2000 to 2017
Haewoon Kwak, Jisun An, Yong-Yeol Ahn
Proceedings of the 12th ACM Conference on Web Science (WebSci), 2020

Identifying and Characterizing Alternative News Media on Facebook
Samuel S Guimarães, Julio CS Reis, Lucas Lima, Filipe N Ribeiro, Marisa Vasconcelos, Jisun An, Haewoon Kwak, Fabrício Benevenuto
IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2020

What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context
Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass, Preslav Nakov
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL) (2020)

Empirical Evaluation of Three Common Assumptions in Building Political Media Bias Datasets
Soumen Ganguly, Juhi Kulshrestha, Jisun An, Haewoon Kwak
Proceedings of the 14th International AAAI Conference on Web and Social Media (ICWSM), 2020

“Trust Me, I Have a Ph.D.”: A Propensity Score Analysis on the Halo Effect of Disclosing One’s Offline Social Status in Online Communities
Kunwoo Park, Haewoon Kwak, Hyunho Song, Meeyoung Cha
Proceedings of the 14th International AAAI Conference on Web and Social Media (ICWSM), 2020

Are These Comments Triggering? Predicting Triggers of Toxicity in Online Discussions
Hind Almerekhi, Haewoon Kwak, Bernard Jim Jansen, Joni Salminen (short)
Proceedings of The Web Conference (WWW), 2020

Going beyond accuracy: estimating homophily in social networks using predictions
George Berry, Antonio Sirianni, Ingmar Weber, Jisun An, Michael Macy (preprint)
arXiv preprint arXiv:2001.11171, 2020

Persona Perception Scale: Development and Exploratory Validation of an Instrument for Evaluating Individuals’ Perceptions of Personas
Joni Salminen, Joao M. Santos, Haewoon Kwak, Jisun An, Soon-gyo Jung, Bernard J. Jansen
International Journal of Human-Computer Studies, 2020

Tanbih: Get To Know What You Are Reading
Yifan Zhang, Giovanni Da San Martino, Alberto Barrón-Cedeño, Salvatore Romeo, Jisun An, Haewoon Kwak, Todor Staykovski, Israa Jaradat, Georgi Karadzhov, Ramy Baly, Kareem Darwish, James Glass, Preslav Nakov (demo)
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019

Gender and Racial Diversity in Commercial Brands’ Advertising Images on Social Media
Jisun An, Haewoon Kwak
Proceedings of Social Informatics (SocInfo), 2019
Best Paper Award

Stylistic Features Usage: Similarities and Differences Using Multiple Social Networks
Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen
Proceedings of Social Informatics (SocInfo), 2019

Predicting Audience Engagement Across Social Media Platforms in the News Domain
Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen
Proceedings of Social Informatics (SocInfo), 2019

Detecting Toxicity Triggers in Online Discussions
Hind Almerekhi, Haewoon Kwak, Bernard Jim Jansen, Joni Salminen (poster)
Proceedings of the 30th ACM Conference on Hypertext and Social Media (HT), 2019

Political Discussions in Homogeneous and Cross-Cutting Communication Spaces
Jisun An, Haewoon Kwak, Oliver Posegga, Andreas Jungherr
Proceedings of the 13th International AAAI Conference on Web and Social Media (ICWSM), 2019

View, Like, Comment, Post: Analyzing User Engagement by Topic at 4 Levels across 5 Social Media Platforms for 53 News Organizations
Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen
Proceedings of the 13th International AAAI Conference on Web and Social Media (ICWSM), 2019

The Challenges of Creating Engaging Content: Results from a Focus Group Study of a Popular News Media Organization
Kholoud Khalil Aldous, Jisun An, Bernard J. Jansen (Extended Abstracts)
Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems (CHI), 2019

Social media mining for journalism
Arkaitz Zubiaga, Bahareh Heravi, Jisun An, Haewoon Kwak (Guest editorial)
Online Information Review, 2019

Discursive Power in Contemporary Media Systems: A Comparative Framework
Andreas Jungherr, Oliver Posegga, Jisun An
The International Journal of Press/Politics, 24(4), 2019

Reports of the Workshops Held at the 2018 International AAAI Conference on Web and Social Media
Jisun An, Rumi Chunara, David J. Crandall, Darian Frajberg, Megan French, Bernard J. Jansen, Juhi Kulshrestha, Yelena Mejova, Daniel M. Romero, Joni Salminen, Amit Sharma, Amit Sheth, Chenhao Tan, Samuel Hardman Taylor, Sanjaya Wijeratne
AI Magazine, 2018

Imaginary People Representing Real Numbers: Generating Personas from Online Social Media Data
Jisun An, Haewoon Kwak, Soon-gyo Jung, Joni Salminen, M. Admad, Bernard J. Jansen
ACM Transactions on the Web, 12(4), 2018

Assessing the Accuracy of Four Popular Face Recognition Tools for Inferring Gender, Age, and Race
Soon-gyo Jung, Jisun An, Haewoon Kwak, Joni Salminen, Bernard Jim Jansen (short)
Proceedings of the 12th International AAAI Conference on Web and Social Media (ICWSM), 2018

Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media
Joni Salminen, Hind Almerekhi, Milica Milenković, Soon-gyo Jung, Jisun An, Haewoon Kwak, Bernard J. Jansen
Proceedings of the 12th International AAAI Conference on Web and Social Media (ICWSM), 2018

Identifying Regional Trends in Avatar Customization
Peter Mawhorter, Sercan Şengün, Haewoon Kwak, D. Fox Harrell
IEEE Transactions on Games, 10(2), 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment
Jisun An, Haewoon Kwak, Yong-Yeol Ahn
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), 2018

What We Read, What We Search: Media Attention and Public Attention among 193 Countries
Haewoon Kwak, Jisun An, Joni Salminen, Soon-Gyo Jung, Bernard J. Jansen.
Proceedings of the 2018 World Wide Web Conference (WWW), 2018

What is Gab? A Bastion of Free Speech or an Alt-Right Echo Chamber?
Savvas Zannettou, Barry Bradlyn, Emiliano De Cristofaro, Haewoon Kwak, Michael Sirivianos, Gianluca Stringhini, Jeremy Blackburn
Companion Proceedings of the The Web Conference (WWW), 2018
Press coverage-New Scientist, and Vice

Fixation and Confusion: Investigating Eye-tracking Participants’ Exposure to Information in Personas
Joni Salminen, Bernard J. Jansen, Jisun An, Soon-Gyo Jung, Lene Nielsen, Haewoon Kwak
Proceedings of the 2018 Conference on Human Information Interaction & Retrieval (CHIIR), 2018

“Is More Better?”: Impact of Multiple Photos on Perception of Persona Profiles
Joni Salminen, Lene Nielsen, Soon-Gyo Jung, Jisun An, Haewoon Kwak, Bernard J. Jansen
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI), 2018

Reports of the Workshops Held at the 2017 International AAAI Conference on Web and Social Media
Jisun An, Giovanni Luca Ciampaglia, Nir Grinberg, Kenneth Joseph, Alexios Mantzarlis, Gregory Maus, Filippo Menczer, Nicholas Proferes, Brooke Foucault Welles
AI Magazine, 2017

Convergence of Media Attention Across 129 Countries
Jisun An, Hassan Aldarbesti, Haewoon Kwak
Proceedings of Social Informatics (SocInfo), 2017

Multidimensional Analysis of the News Consumption of Different Demographic Groups on a Nationwide Scale
Jisun An, Haewoon Kwak
Proceedings of Social Informatics (SocInfo), 2017

Multiplex Media Attention and Disregard Network among 129 Countries
Haewoon Kwak, Jisun An
Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2017

Demographics of News Sharing in Twitter
Julio Reis, Haewoon Kwak, and Jisun An, Johnnatan Messias, Fabrıcio Benevenuto
Proceedings of the 28th ACM Conference on Hypertext and Social Media (HT), 2017

Data-driven Approach to Measuring the Level of Press Freedom Using Media Attention Diversity from Unfiltered News
Jisun An, Haewoon Kwak
Proceedings of the ICWSM Workshop on NEws and publiC Opinion (NECO), 2017
Picked as The Best of the Physics arXiv (week ending April 15, 2017) in MIT Technology Review

What Gets Media Attention and How Media Attention Evolves Over Time - Large-scale Empirical Evidence from 196 Countries
Jisun An, Haewoon Kwak (short)
Proceedings of the 11th International AAAI Conference on Web and Social Media (ICWSM), 2017

Persona Generation from Aggregated Social Media Data
Soon-Gyo Jung, Jisun An, Haewoon Kwak, Moeed Ahmad, Lene Nielsen, Bernard J. Jansen (Extended Abstract)
Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems (CHI), 2017

I Would Not Plant Apple Trees If the World Will Be Wiped: Analyzing Hundreds of Millions of Behavioral Records of Players During an MMORPG Beta Test
Ah Reum Kang, Jeremy Blackburn, Haewoon Kwak, Huy Kang Kim
Proceedings of the 26th International Conference on World Wide Web (WWW) Companion, 2017
Press coverage-New Scientist, IFL Science, PC Gamer, Massively OK, El Confidencial, Joongang Ilbo, and so on.

Achievement and Friends: Key Factors of Player Retention Vary Across Player Levels in Online Multiplayer Games
Kunwoo Park, Meeyoung Cha, Haewoon Kwak, Kuan-Ta Chen
Proceedings of the 26th International Conference on World Wide Web (WWW) Companion, 2017

Culturally-Grounded Analysis of Everyday Creativity in Social Media: A Case Study in Qatari Context
D. Fox Harrell, Sarah Vieweg, Haewoon Kwak, Chong-U Lim, Sercan Sengun, Ali Jahanian, Pablo Ortiz
Proceedings of the 2017 ACM SIGCHI Conference on Creativity and Cognition (C&C), 2017

Who Are Your Users? Comparing Media Professionals’ Preconception of Users to Data-Driven Personas
Lene Nielsen, Soon-Gyo Jung, Jisun An, Joni Salminen, Haewoon Kwak, Bernard J. Jansen
Proceedings of the 29th Australian Conference on Computer-Human Interaction (OZCHI), 2017

Generating Cultural Personas from Social Data: A Perspective of Middle Eastern Users
J. Salminen, S. Sengün, H. Kwak, B. Jansen, J. An, S. Jung, S. Vieweg, D. F. Harrell
Proceedings of the 5th International Conference on Future Internet of Things and Cloud Workshops, 2017

Personas for Content Creators via Decomposed Aggregate Audience Statistics
Jisun An, Haewoon Kwak, Bernard J. Jansen (short)
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2017

High correlation of Middle East respiratory syndrome spread with Google search and Twitter trends in Korea
Soo-Yong Shin, Dong-Woo Seo, Jisun An, Haewoon Kwak, Sung-Han Kim, Jin Gwack, Min-Woo Jo
Scientific Reports 6, Article number 32920 (2016)

Multidimensional Analysis of Gender and Age Differences in News Consumption
Jisun An, Haewoon Kwak
Computation+Journalism (C+J) Symposium (2016)

Revealing the Hidden Patterns of News Photos: Analysis of Millions of News Photos Using GDELT and Deep Learning-based Vision APIs
Haewoon Kwak, Jisun An
ICWSM Workshop on NEws and publiC Opinion (NECO), 2016
Picked as The Best of the Physics arXiv (week ending March 26, 2016) in MIT Technology Review

Two Tales of the World: Comparison of Widely Used World News Datasets: GDELT and EventRegistry
Haewoon Kwak, Jisun An (short)
Proceeding of the 10th International Conference on Web and Social Media (ICWSM), 2016

Are You Charlie or Ahmed? Cultural Pluralism in Charlie Hebdo Response on Twitter
Jisun An, Haewoon Kwak, Yelena Mejova, Sonia Alonso Saenz De Oger, Braulio Gomez Fortes
Proceeding of the 10th International Conference on Web and Social Media (ICWSM), 2016

#greysanatomy vs. #yankees: Demographics and Hashtag Use on Twitter.
Jisun An, Ingmar Weber (short)
Proceeding of the 10th International Conference on Web and Social Media (ICWSM), 2016

Whom should we sense in ‘social sensing’-analyzing which users work best for social media now-casting
Jisun An, Ingmar Weber
EPJ Data Science, 4, Article number 22, 2015

Consumers and Suppliers: Attention asymmetries. A Case Study of Aljazeera’s News Coverage and Comments
Sofiane Abbar, Jisun An, Haewoon Kwak, Yacine Messaoui, Javier Borge-Holthoefer
Computation+Journalsim (C+J) Symposium, 2015

Breaking the News: First Impressions Matter on Online News
Julio Reis, Fabrıcio Benevenuto, Pedro Olmo, Raquel Prates, Haewoon Kwak, Jisun An
Proceeding of the 9th International Conference on Web and Social Media (ICWSM), 2015
Picked as Other Interesting arXiv Papers (Week ending April 11, 2015) in MIT Technology Review, and O Globo
100+ papers citing this work (Google scholar)

Exploring Cyberbullying and Other Toxic Behavior in Team Competition Online Games
Haewoon Kwak, Jeremy Blackburn, Seungyeop Han
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI), 2015
100+ papers citing this work (Google scholar)

From Cells to Streets: Estimating Mobile Paths with Cellular-Side Data
Ilias Leontiadis, Antonio Lima, Haewoon Kwak, Rade Stanojevic, David Wetherall, Konstantina Papagiannaki
Proceedings of the 10th ACM International on Conference on emerging Networking Experiments and Technologies (CoNEXT), 2014

Understanding News Geography and Major Determinants of Global News Coverage of Disasters
Haewoon Kwak, Jisun An (extension of SocInfo’14)
Computation+Journalism (C+J) Symposium, 2014

A First Look at Global News Coverage of Disasters By Using the GDELT Dataset
Haewoon Kwak, Jisun An
Proceedings of Social Informatics, 2014
Press Coverage-MIT Technology Review, ACM TechNews

Linguistic Analysis of Toxic Behavior in an Online Video Game
Haewoon Kwak, Jeremy Blackburn
SocInfo Workshop on Exploration on Games and Gamers (EGG), 2014

Searching for a Unique Style in Soccer
Laszlo Gyarmati, Haewoon Kwak, Pablo Rodriguez
KDD Workshop on Large-Scale Sports Analytics, 2014
Press coverage-BBC, MIT Technology Review, The Times, The Economist, Slate, Pacific Standard, and so on.

Didn’t You See My Message? Predicting Attentiveness to Mobile Instant Messages
Martin Pielot, Rodrigo de Oliveira, Haewoon Kwak, Nuria Oliver
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI), 2014
150+ papers citing this work (Google scholar)

STFU NOOB! Predicting Crowdsourced Decisions on Toxic Behavior in Online Games
Jeremy Blackburn, Haewoon Kwak
Proceedings of the 23rd international conference on World wide web (WWW), 2014
Press coverage-Nature, Scientific American, Chosun Ilbo
100+ papers citing this work (Google scholar)

Has Much Potential but Biased: Exploring the Scholarly Landscape in Twitter
Haewoon Kwak, Jonggun Lee (poster)
Proceedings of the 23rd International Conference on World Wide Web Companion, 2014

Sharing political news: the balancing act of intimacy and socialization in selective exposure
Jisun An, Daniele Quercia, Meeyoung Cha, Krishna Gummadi, Jon Crowcroft
EPJ Data Science volume 3, Article number 12, 2014

Recommending investors for crowdfunding projects
Jisun An, Daniele Quercia, Jon Crowcroft
Proceedings of the 23rd international conference on World wide web (WWW), 2014
Press coverage-FastCompany

Partisan Sharing: Facebook Evidence and Societal Consequences
Jisun An, Daniele Quercia, Jon Crowcroft
Proceedings of the Second ACM Conference on Online Social Networks (COSN), 2014

Tower of Babel: A Crowdsourcing Game Building Sentiment Lexicons for Resource-scarce Languages
Yoonsung Hong, Haewoon Kwak, Youngmin Baek, Sue Moon
WWW Workshop on Multidisciplinary Approaches to Big Social Data Analysis, 2013

Structures of Broken Ties: Exploring Unfollow Behavior on Twitter
Bo Xu, Yun Huang, Haewoon Kwak, Noshir S. Contractor
Proceedings of the 16th ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW), 2013

Why Individuals Seek Diverse Opinions (or Why They Don’t)
Jisun An, Daniele Quercia, Jon Crowcroft
Proceedings of the 5th Annual ACM Web Science Conference (WebSci), 2013

Why Do I Retweet It? An Information Propagation Model for Microblogs
Fabio Pezzoni, Jisun An, Andrea Passarella, Jon Crowcroft, Marco Conti
Proceedings of the 5th International Conference on Social Informatics (SocInfo), 2013

Traditional Media Seen from Social Media
Jisun An, Daniele Quercia, Meeyoung Cha, Krishna Gummadi, Jon Crowcroft
Proceedings of the 5th Annual ACM Web Science Conference (WebSci), 2013

Fragmented Social Media: A Look into Selective Exposure to Political News
Jisun An, Daniele Quercia, Jon Crowcroft (poster)
Proceedings of the 22nd International Conference on World Wide Web (WWW) Companion, 2013

More of a Receiver than a Giver: Why Do People Unfollow in Twitter?
Haewoon Kwak, Sue Moon, Wonjae Lee (4 page poster)
Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (ICWSM), 2012

Visualizing Media Bias through Twitter
Jisun An, Meeyoung Cha, Krishna Gummadi, Jon Crowcroft, Daniele Quercia
ICWSM Workshop on the Potential of Social Media Tools and Data for Journalists, 2012

Consistent Community Identification in Complex Networks
Haewoon Kwak, Sue Moon, Young-Ho Eom, Yoonchan Choi, Hawoong Jeong
Journal of Korean Physical Society, Vol. 59, No. 5, November 2011.

Fragile Online Relationship: a First Look at Unfollow Dynamics in Twitter
Haewoon Kwak, Hyunwoo Chun, Sue Moon
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI), 2011.
Press coverage - Kyunghyang Shinmun
150+ papers citing this work (Google scholar)

Media Landscape in Twitter: A World of New Conventions and Political Diversity
Jisun An, Meeyoung Cha, Krishna Gummadi, Jon Crowcroft
Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM), 2011.
190+ papers citing this work (Google scholar)

What is Twitter, a Social Network or a News Media?
Haewoon Kwak, Changhyun Lee, Hosung Park, Sue Moon
Proceedings of the 19th international conference on World wide web (WWW), 2010.
Press coverage - Mashable Op-Ed, ReadWrite, The Guardian, PC News, Chosun Ilbo, DongA Ilbo
8000+ papers citing this work (Google scholar)

Finding Influentials based on the Temporal Order of Information Adoption in Twitter
Changhyun Lee, Haewoon Kwak, Hosung Park, Sue Moon (poster)
Proceedings of the 19th international conference on World wide web (WWW), 2010.
100+ papers citing this work (Google scholar)

Understanding Topological Mesoscale Features in Community Mining
Sue Moon, Jinyoung You, Haewoon Kwak, Daniel Kim, and Hawoong Jeong (invited paper)
Proceedings of the Second International Conference on COMmunication Systems and NETworks (COMSNETS), 2010.

Analyzing the Video Popularity Characteristics of Large-Scale User Generated Content Systems
Meeyoung Cha, Haewoon Kwak, Pablo Rodriguez, Yong-Yeol Ahn, and Sue Moon
ACM/IEEE Transactions on Networking, Vol 17, Issue 5, 2009
500+ papers citing this work (Google scholar)

Mining Communities in Networks: a Solution for Consistency and Its Evaluation
Haewoon Kwak, Yoonchan Choi, Young-Ho Eom, Hawoong Jeong, Sue Moon
Proceedings of the 9th ACM SIGCOMM conference on Internet measurement (IMC), 2009

The Wisdom of the Few: A Collaborative Filtering Approach based on Expert Opinions from the Web
Xavier Amatriaain, Neal Lathia, Josep M. Pujol, Haewoon Kwak, Nuria Oliver
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR), 2009
150+ papers citing this work (Google scholar)

Connecting Users with Similar Interests Across Multiple Web Services
Haewoon Kwak, Hwa-Yong Shin, Jong-Il Yoon, Sue Moon (poster)
Proceedings of the 3rd International AAAI Conference on Weblogs and Social Media (ICWSM), 2009

Comparison of Online Social Relations in Volume vs Interaction: A Case Study of Cyworld
Hyunwoo Chun, Haewoon Kwak, Young-Ho Eom, Yong-Yeol Ahn, Sue Moon, and Hawoong Jeong
Proceedings of the 8th ACM SIGCOMM conference on Internet measurement (IMC), 2008
250+ papers citing this work (Google scholar)

I Tube, You Tube, Everybody Tubes: Analyzing the World’s Largest User Generated Content Video System
Meeyoung Cha, Haewoon Kwak, Pablo Rodriguez, Yong-Yeol Ahn, Sue Moon
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement (IMC), 2009
Best paper award
1,900+ papers citing this work (Google scholar)

Analysis of topological characteristics of huge online social networking services
Yong-Yeol Ahn, Seungyeop Han, Haewoon Kwak, Sue Moon, Hawoong Jeong
Proceedings of the 16th international conference on World Wide Web (WWW), 2007
1,200+ papers citing this work (Google scholar)