The governments of #Australia, #Canada, #Cyprus, #Denmark, #Israel, and #Singapore are likely customers of Israeli #spyware maker #Paragon Solutions

The governments of #Australia, #Canada, #Cyprus, #Denmark, #Israel, and #Singapore are likely customers of Israeli #spyware maker #Paragon Solutions
LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Games
"Proposes using text-based negotiation games as benchmarks for evaluating LLMs’ reasoning and decision making capabilities, by using Chain-of-Thought prompts (CoT)." [gal30b+] #CL #CY #LG
https://github.com/S-Abdelnabi/LLM-Deliberation
https://arxiv.org/abs/2309.17234v1 #arxiv
How We Define Harm Impacts Data Annotations: Explaining How Annotators Distinguish Hateful, Offensive, and Toxic Comments
"Uses Venn Diagrams, information gain comparison, and content analysis to study the differences between annotator's use of three concepts - hateful, offensive, and toxic and to identify the factors that explain these differences." [gal30b+] #CL #CY
The Cambridge Law Corpus: A Corpus for Legal AI Research
"The CLC corpus consists of court cases from the UK, mostly from the last decade, as well as earlier cases going back to the 16th century." [gal30b+] #CL #CY
https://github.com/cambridge-ai-and-law-project/CambridgeLawCorpus"
https://arxiv.org/abs/2309.12269v1 #arxiv
Bad Actor, Good Advisor: Exploring the Role of Large Language Models in Fake News Detection
"An adaptive rationale guidance network (ARG) for fake news detection is proposed that utilizes the knowledge of the large language model to guide the small language model in fake news detection." [gal30b+] #CL #AI #CY
https://github.com/ICTMCG/ARG
https://arxiv.org/abs/2309.12247v1 #arxiv
Focal Inferential Infusion Coupled with Tractable Density Discrimination for Implicit Hate Speech Detection
"Focused Inferential Adaptive Density Discrimination, a novel framework that enhances implicit hate speech detection by bringing the surface form of an implicit hate speech closer to its implied form while increasing the inter-cluster distance among various class labels." [gal30b+] #CL #CY
https://github.com/LCS2-IIITD/FIADD
https://arxiv.org/abs/2309.11896v1 #arxiv
Casteist but Not Racist? Quantifying Disparities in Large Language Model Bias Between India and the West
"The LLMs tested are found to exhibit strong stereotypical and anti-stereotypical biases in the Indian caste and religion contexts, especially as compared to the Western context." [gal30b+] #CL #CY
https://github.com/khyatikhandelwal/Indian-LLMs-Bias
https://arxiv.org/abs/2309.08573v1 #arxiv
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics
"Visual instruction tuning, a prevailing strategy for transitioning LLMs into MLLMs, unexpectedly helps models attain both improved truthfulness and ethical alignment in the pure NLP context." [gal30b+] #CL #AI #CV #CY #LG
github.com/UCSC-VLAA/Sight-Beyond-Text
https://arxiv.org/abs/2309.07120v1 #arxiv
The Challenges of Machine Learning for Trust and Safety: A Case Study on Misinformation Detection
"Analyzes 270 well-cited papers and identify shortcomings in the literature that call into question claimed performance and practicality of automated misinformation detection models." [gal30b+] #LG #CL #CY
https://github.com/citp/sok_misinformation
https://arxiv.org/abs/2308.12215v1 #arxiv
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
"LegalBench consists of 162 tasks covering 6 different types of legal reasoning, designed and hand-crafted by legal professionals, and evaluated against 20 open-source and commercial language models." [gal30b+] #CL #AI #CY
https://github.com/HazyResearch/legalbench/
https://arxiv.org/abs/2308.11462v1 #arxiv
Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias
"We examine three cognitive biases - the decoy effect, the certainty effect, and the belief bias - which are known to influence human decision-making and reasoning." [gal30b+] #CL #AI #CY #LG
https://github.com/google-research/
https://arxiv.org/abs/2308.00225v1 #arxiv