Text analysis (sometimes referred to as Natural Language Processing, or NLP) is the system for determining the theme and sentiment of a comment. For any open text questions (where the answer provided by the respondent is a written comment), we apply machine learning algorithms to provide the best estimate of what the theme of the comment is, and whether the sentiment is overly positive or negative. This is a very valuable part of using Qlearsite:
- it helps you summarise a large volume of comments in a quantitative way;
- it helps you find patterns in the text data that can be compared against the way respondents scored their other answers;
- it helps you identify potential root causes for particular issues.
What makes Qlearsite’s text analysis different from the competition?
We use sophisticated deep learning techniques to surface themes and sentiment, employing models that have been specifically trained on our ‘employee feedback’ domain. Many tools count words, our language analysis technology reads and understands them. We analyse the whole comment, to go deep into it’s meaning. Why? Because context matters and changes meaning.
For example, in this hypothetical sentence “This is not a black and white issue“
…basic, less accurate tools might think it is about ‘Diversity & Inclusion’. That’s because they scan for keywords – ‘black’ and ‘white’. Our technology understands the whole sentence and confidently classifies it as ‘Decision Making’.
On such deeply sensitive topics, being more accurate is essential. That’s why we’re trusted by the leading employers to analyse their valuable, sensitive employee feedback.
How does the Qlearsite Platform’s text analysis technology work?
Natural language processing is a subset of the broad field of artificial intelligence. It is applied to problems involving analytics, semantic search, text generation, optical character recognition, etc that allow computers to manipulate text. We specifically use it for topic modelling to find the topics (what we call ‘themes’) and meaning in the comments respondents write in their survey replies.
However we don’t share the details of our models as they are proprietary. In summary, our models are trained to find the most appropriate themes in a collection of sentences using the words, sentence structure and many other characteristics. Our models go beyond simple text classification, so called ‘bag of words’ models, often used for word frequencies and sentiment analysis.
Topic modelling assumes that each complex answer to an open question is a mixture of potential themes. We train the model to spot these hidden, or ‘latent’, themes to find the dominant topics in the text. This is important so that the model doesn’t rely on the actual key word to be present in order to spot the theme. The more data we have about a domain (such as employee engagement) the more it allows us to refine the model to be highly sensitive to recurring themes (such as pay, or diversity, etc).
How does the Qlearsite Platform ensure the text analysis is not biased?
It is inevitable to have bias in NLP text analysis because models are trained on a human language, that will potentially include gender bias and stereotypes. Machine Learning models will therefore learn these implicit biases and patterns. He have counteracted for bias by:
- Taking care in how we select our training data sets;
- Involving a broad range of staff from different backgrounds in annotating the training data set.
We are also looking at building a shorter feedback loop within the actual product, whereby our users can provide direct feedback to the model on specific issues and biases they see in their results, helping us speed up the rate of learning. Providing feedback is valuable to us and ask that you continue to send us questions and comments whenever they arise to email@example.com.