Technologies: Rhetoric Analytics

In The New Ethos, we develop Rhetoric Analytics, i.e., a sense-making technology that provides insights into communication strategies used in large-scale discourse. It is founded upon data analytics, i.e., a wide-spread (cf. Business Intelligence, Media Analysis), science-enabled range of tools, techniques and processes used to convert raw data into insights, to analyse such data in order to make conclusions about information, to interpret them in a meaningful way and to extract knowledge. As a result, we are able to reveal statistical patterns, trends and tendencies of the variety of devices and strategies in rhetoric of the digital society. The modular architecture of this technology allows us to develop each module of analytics independently and to add more and more modules that increase robustness in analysing variety of rhetorical and linguistic uses of language in the digital communication. Below, you will find the description of the currently available modules of Rhetoric Analytics. At the very bottom, you will find the descriptions of corpora on which Rhetoric Analytics run.

LEPAn

Logos-Ethos-Pathos Analytics is the foundational tool in the Rhetoric Analytics technology, as it allows us to inspect the use of key strategies of logos, ethos and pathos in the rhetoric of the digital society. It enables the analyst to understand not only a single rhetorical device, but also to inspect similarities and differences between these three types of rhetorical behaviour.

LEPAn-v01. This is the initial version of the tool that allows the analysis of logos, ethos and pathos, as well as comparison between them in online discourse: (1) Reddit discussions on COVID-19 vaccination, PolarIs1; and (2) Reddit reactions to the US 2016 presidential debates, PolarIs5.

Available at: https://lepan-v01.streamlit.app/, developed by Ewelina Gajewska
User manual can be downloaded here: LEPAn-v01 user manual
Please read and cite these papers, if you are using LEPAn-v01 tool:
— K. Budzynska, M. Koszowy, E. Gajewska, M. Kulik, M. Uberna (2024) A Computational Method for Quantitative Analysis of Ethos. In: A Hess and J E Kjeldsen (Eds) Ethos, Technology, and AI in Contemporary Society: The Character in the Machine, Routledge.
— E. Gajewska, K. Budzynska, B. Konat (2024) Analytics for Rhetorical Arguments in Discussions on Polarised Issues. In: Proceedings of the 10th International Conference on Computational Models of Argument (COMMA2024), IOS Press.

DynRephAn

Dynamics of Rephrase Analytics is the second foundational tool in Rhetoric Analytics, as it allows us to analyse the transformations of the use of rhetorical devices as a result of rephrasing information, i.e. to analyse them as they change when speakers rephrase what they say. The special role of DynRephAn consists in treating an argument relation of rephrase as a process of how a debate is evolving, how rhetorical devices are changed and manipulated by speakers. This means that we are able to inspect not only results of rhetorical or linguistic use of language, e.g., by comparing the frequencies of using logos vs ethos, but we are also able to trace how speakers were strategically influencing the character of the discussion, e.g., by shifting from using pure logos to using logos loaded with ethos.

DynRephAn-v01. This is the initial version of the tool that allows the analysis of ethos and sentiment in rephrase in social media, traditional media and face-to-face discussions in the following corpora: (1) Reddit discussions on COVID-19 vaccination, PolarIs1; (2) Reddit reactions to the US 2016 presidential debates, PolarIs5; (3) televised presidential debates in the US from 2016, USElections; and (4) UK parliamentary debates from the period of Margaret Thatcher as a prime minister, Hansard. It allows qualitative analysis of merged corpora or comparative study of each single corpus with dynamics of ethos or sentiment counts presented in piecharts, barcharts, wordclouds and tables.

Available at: https://dynrephan-v01.streamlit.app/, developed by Maciej Uberna
User manual can be downloaded here: APPEARS SOON
Please read and cite these papers, if you are using DynRephAn-v01 tool:
— K. Budzynska, M. Koszowy, E. Gajewska, M. Kulik, M. Uberna (2024) A Computational Method for Quantitative Analysis of Ethos. In: A Hess and J E Kjeldsen (Eds) Ethos, Technology, and AI in Contemporary Society: The Character in the Machine, Routledge.
— M. Uberna, K. Budzynska, M. Koszowy (2024) Exploring The Dynamics of Reformulation in Natural Dialogue. PPRAI 2024, DOI:10.17388/WUT.2024.0002.MiNI page 541-548.

DynRephAn-v02. This is the extended version that allows more complex analysis of single corpus and comparative examinations of selected corpora. New options allow part of speech analysis, ranking of most frequent n-gram (up to 4-grams), parts of speech ranking, lexemes ranking and synonym group ranking as well as parts of speech combinations rankings.

Available at: https://dynrephrean-v02.streamlit.app/, developed by Maciej Uberna
User manual can be downloaded here: APPEARS SOON
Please read and cite these papers, if you are using DynRephAn-v01 tool:
— M. Uberna, K. Budzynska, M. Koszowy, P. Saint-Dizier (2024) Analytics for Linguistically Characterising Rephrased Arguments, Proceedings of the 10th International Conference on Computational Models of Argument (COMMA2024), IOS Press.

MArgAn

Moral Argument Analytics is a tool in Rhetoric Analytics that allows us to investigate the use of moral values in argumentation. Morals serve as a core of what we believe and how we see the world. This is why a tool, which enables the identification and understanding of whether people appeal to dignity or loyalty, benevolence or justice, authority or equality, provides a deep insight into what truly drives a discussion and argumentation.

MArgAn-v01. This is the initial version of the tool that allows the automatic detection of moral foundations (defined according to the Moral Foundations Theory, Haidt and Joseph 2004) in large-scale datasets of dialogical arguments, and the statistical summaries and visual representations of moral foundations patterns in argumentation.

Available at: https://margan-v01.streamlit.app/, developed by He Zhang
User manual can be downloaded here: MArgAn-v01 user manual
Please read and cite these papers, if you are using MArgAn-v01 tool:
— A. Landowska, K. Budzynska, H. Zhang (2024) Quantitative and Qualitative Analysis of Moral Foundations in Argumentation, Argumentation 38, 405–434, DOI 10.1007/s10503-024-09636-x
— H. Zhang, A. Landowska, K. Budzynska (2024) Detection and Analysis of Moral Values in Argumentation. In: N. Osman, L. Steels (Eds.) Value Engineering in Artificial Intelligence, Lecture Notes in Computer Science, LNCS (Post-Proceedings of A workshop affiliated with the 26th European Conference on Artificial Intelligence: Value engineering in AI), volume 14520, p 114-141.

OLAn

OLAn is our tool that allows us to analyse the use of offensive language and to identify patterns and trends in its use. This rhetorical strategy is important, as it overlaps with hate speech and polarisation.

OLAn-v01. The initial version of OLAn is a social media analysis technology developed to understand the use of offensive language in social media discourse, i.e., in Twitter discussions during the 2021 United Nations Climate Change Conference. More specifically, it allows us to analyse and visualise patterns in offensive reactions to public figures across their social roles and popularity in polarised discourse online.

Available at: https://olan-v01.streamlit.app/, developed by He Zhang
User manual can be downloaded here: OLAn-v01 user manual
Please read and cite this paper, if you are using OLAn-v01 tool:
— M. Kulik, K. Budzynska, H. Zhang, M-A. Paquin, B. Konat (2025) Offensive Language in Reactions to Public Figures in Polarised Discourse Online, Journal of Language Aggression and Conflict, John Benjamins Publishing Company. DOI 10.1075/jlac.00136.kul

Other tools

More information will be published soon on other tools developed in the Laboratory of The New Ethos. Papers that introduce them are currently in review, so we are not able to reveal their names and descriptions now.

Corpora

ETHOS SUPPORTS AND ATTACKS. The Thatcher’s Ethos in Hansard corpus was constructed by taking a random subsample of Hansard according to the following rubric: select the first two House of Commons debates over 700 words in length from the day closest to the date(s) at the midpoint(s) of the largest uninterrupted date range(s) (initially the midpoint in the range 4th May 1979 and 22nd November 1990 – viz., 11th February 1985; then at the midpoints between 4th May 1979 and 11th February 1985, and between 11th February 1985 and 22nd November 1990, etc.). This avoids bias for annotators and yielded 60 transcripts, the data in each of which was then cleaned such that any titles and section markers were removed to leave only the speakers, organisations or other entities and the statements they made. The transcripts were then split evenly to give a training set and a testing set. The training set formed the training data for the sentiment polarity classifier and was used as the basis for developing domain specific rules for recognising ethotic sentiment expressions.

Ethos support. Ethos support should be identified when: (a) the statement makes explicit mentions of a person, organisation or other entity (excluding groups and assemblages) except when this is reported speech; and (b) it takes the form of supporting a person’s credibility or looking to put them in a positive frame through character supports or supports of work; and (c) a support to a person’s own ethos should not be analysed as this is deemed to be a fallacy. Ethos attack. Ethos attack should be identified when: (a) the statement makes explicit mentions of a person, organisation or other entity (excluding groups and assemblages) except when this is reported speech; and (b) it takes the form of attacking a person’s credibility or looking to put them into a negative frame; or (c) it may take the form of trying to unbalance authority on a subject giving the attacker more of a right to talk about the subject.

Corpus Ethos-Hansard1 available at: https://corpora.aifdb.org/Hethos

Annotation scheme can be downloaded here: Ethos_AnnSch_v01
Please cite this paper, if you are using Ethos-Hansard1 corpus:
R. Duthie, K. Budzynska, C. Reed (2016) Mining Ethos in Political Debate, Frontiers in Artificial Intelligence and Applications. Proc. of 6th International Conference on Computational Models of Argument (COMMA 2016), Pietro Baroni, Thomas F. Gordon, Tatjana Scheffler, Manfred Stede (Eds.), vol. 287, IOS Press, pp. 299-310.

Corpus Ethos-Hansard2 available at: https://corpora.aifdb.org/EthosHansard2

Annotation scheme can be downloaded here: Ethos_AnnSch_v02
Please cite this paper, if you are using Ethos-Hansard2 corpus:
R. Duthie, K. Budzynska (2018) A Deep Modular RNN Approach for Ethos Mining, Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18, 4041-4047.

ETHOS TYPES: WISDOM, VIRTUE, GOODWILL. A 12% of the data was annotated by two annotators for the purpose of evaluation. Overall this gave Cohen’s kappa=0.42, a percentage agreement of 57\% and weighted $\kappa=0.63$. The kappa score is considered fair but the weighted kappa score shows room for improvement in the guidelines. To this extent a third annotator annotated a smaller 8% subset of the data to provide a pairwise agreement score. Comparing all three annotators gave Fleiss kappa = 0.51. A pairwise comparison of annotators one and three gave kappa = 0.63 and weighted kappa = 0.79. Comparing annotators two and three gave kappa = 0.51 and weighted kappa = 0.70.

Building upon the Aristotelian distinction between ethos types we proposed the annotation of ethos types using three main tags (Practical Wisdom, Moral Virtue and Goodwill) split into support and attack (Argument and Conflict) according to the following guidelines: Practical Wisdom: Argument From Practical Wisdom should be annotated when: (a) an entity is said to have sufficient knowledge for the purpose at hand; or (b) an entity can draw conclusions from this knowledge; or (c) an entity has practical experience; (d) an entity can draw conclusions from this experience. While Conflict From Practical Wisdom should be annotated when the opposite is true. Moral Virtue: Argument From Moral Virtue should be annotated when: (a) a statement refers to the character trait of an entity, when the entity shows positive morality, calmness, justness, selflessness, gracefulness, nobility, positive contributions, liberality, magnanimity or magnificence; or (b) when an entity provides the correct information. While Conflict From Moral Virtue should be annotated when the opposite is true. Goodwill: Argument From Goodwill should be annotated when: (a) a statement refers to an entity’s ability to show goodwill to others; or (b) an entity gives sound advice when it is know, ensuring the entity does not deceive while being inclusive; or (c) an entity aligns with an audience’s values, displaying self sacrifice. While Conflict From Goodwill should be annotated when the opposite is true.

EthosHansard2_WVG1 available at: https://corpora.aifdb.org/EthosHansard2WVG1
Annotation scheme can be downloaded here: Ethos-WVG_AnnSch_v01
Please cite this paper, if you are using WVG1 corpus:
R. Duthie, K. Budzynska (2018) Classifying Types of Ethos Supports and Attacks, In: Modgil S., Budzynska K. and Lawrence J. (Eds.) Proc. of 7th International Conference on Computational Models of Argument (COMMA 2018), Frontiers in Artificial Intelligence and Applications, Volume 305, pp. 161 – 168.
EthosHansard2_WVG2 available at: https://corpora.aifdb.org/EthosHansard2WVG2
EthosHansard2_WVG3 available at: https://corpora.aifdb.org/EthosHansard2WVG3
Annotation scheme can be downloaded here: Ethos-WVG_AnnSch_v02 and Ethos-WVG_AnnSch_v03
Please cite this paper, if you are using WVG corpus:
M. Koszowy, K. Budzynska, M. Pereira-Fariña, R. Duthie (2022) From Theory of Rhetoric to the Practice of Language Use: The Case of Appeals to Ethos Elements, Argumentation, 36, p. 123–149.

LOGOS ACCORDING TO INFERENCE ANCHORING THEORY. In The New Ethos, logos is annotated according to Inference Anchoring Theory, IAT (Budzynska & Reed, 2010). This framework allows for the representation of argument, dialogical and illocutionary structures as well as interactions between the three of them. The first version of an IAT annotation scheme was developed by ARG-tech. We use two corpora that were created by this team: US2016tv and US2016reddit. Additionally, our team annotated two further corpora described below.

Corpus IAT-Hansard is available at: https://corpora.aifdb.org/IATHansard2
Corpus IAT-Covid is available at: https://corpora.aifdb.org/PolarIs1
Annotation scheme, Logos_AnnSch_v01, developed in ARG-tech can be downloaded here: https://arg.tech/~jacky/US2016-guidelines.pdf