Skip to content

iug-htw/Sentiment-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sentiment-Analysis

This repository contains the data that was used in support of the HCC 2019 paper Analyzing Sentiments of German Job References.

Authors: Finn Folkerts, Vanessa Schreck, Shirin Riazy and Katharina Simbeck

Published at: more information will follow

For more information on our research group, please go to https://iug.htw-berlin.de/.

Content of the Repository

German Job Reference Corpus: We compiled a test corpus of 843 typical German job reference letter sentences from German books on how to write job reference letters. We combined those template sentences with subjects of varying gender, origin and nobility. To create the German Job Reference Corpus, we combined each template sentence with each of the 30 different surnames and both gender specific titles. This yields 60 distinct sentences originating from the same template. Additionally, we altered each template sentence by replacing the title and surname with the corresponding male or female pronoun, thus adding another two sentences per template to the corpus. Eventually, the corpus consists of 52,266 sentences in total, out of which 1,686 sentences are formed with a pronoun instead of a name.

Sentiment Analysis: We have tested the sentiment of all sentences in the corpus using 4 standard, commercially available sentiment analysis APIs: Google: https://cloud.google.com/natural-language/ AWS: https://aws.amazon.com/comprehend IBM: https://www.ibm.com/watson/services/natural-language-understanding/ Azure: https://azure.microsoft.com/en-en/services/cognitive-services/

The sentiments collected from each service in July 2019 are available in /results.

The scripts used to analyse those sentiment scores are available as python scripts.

Project

The present research was done as part of the project Diskriminiert durch Künstliche Intelligenz (Discriminated by Artificial Intelligence) at Hochschule für Technik und Wirtschaft (University of Applied Sciences) Berlin under the direction of Katharina Simbeck. This research project was funded by Hans-Böckler-Stiftung.

Hochschule für Technik und Wirtschaft

HTW Berlin, 10313 Berlin (Postfach)

Hans-Böckler-Stiftung

Hans-Böckler-Straße 39, 40476 Düsseldorf

Authors

  • Finn Folkerts - HTW Berlin - Email
  • Vanessa Schreck - HTW Berlin - Email
  • Shirin Riazy - HTW Berlin - Email
  • Katharina Simbeck - HTW Berlin - Email

License

Please refer to our LICENSE file for this information.

Citing

If you found this repository or our paper helpful please consider citing us with this bibtex.


@inproceedings{folkerts2019,
  author =       {Folkerts, Finn and Schreck, Vanessa and Riazy, Shirin and Simbeck, Katharina},
  title =        {Analyzing Sentiments of German Job References},
  crossref =     {hcc2019},
  pages =        {??--??},
  doi =          {???},
}

@proceedings{hcc2019,
  editor =       {???},
  title =        "???",
  booktitle =    "???(gleich wie title)",
  publisher =    {???}
  venue =        {Laguna Hills, California, USA},
  month =        sep,
  year =         {2019},
  isbn =         {???},
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages