Nocode functions 🔎
... or read below about what it does, and how it works:
The technology includes a "precision" parameter to control finely if you need big ("macro") topics to be found, or instead if you prefer to identify many "micro" (smaller) topics.
The function identifies pairs of terms in each line of the text. These pairs are called cooccurrences. Aggregating all pairs of terms, a network of terms is constructed. The network is cut into subregions, and each subregion corresponds to a topic.
The principles followed by the tool are described in this academic publication studying how to find communities and topics on Twitter. The technology follows these steps:
Structure and format of the text: Topic extraction works by detecting pairs of terms which appear on the same line of text. So you should be careful about how your text is formatted. Ideally, it should be made of relatively short paragraphs, each on one line. If you are using an Excel file, each paragraph or significant block of text should appear on a different row.Volume of text: topics are found by measuring frequencies: which pair of terms tend to co-occur most often? For this to work, the text should be sufficiently long so that these counts are meaningful. The longer the text, the better. Texts of at least 5,000 words seem a good start.
This approach can make sense when we know in advance how many topics there are in the text. But what is the point of topic detection if we know the topics already? 🤔
In nocode functions, the number of topics to be found is not predetermined. The analyst wil learn a lot by discovering how many topics the algorithm can find in the text, without a preset limit. The analyst remains fully in control thanks to the precision parameter, which helps tune the algorithm to find more or less topics - but always with a degree of freedom on the exact number.
50 is the default. Choose a lower value to find more and smaller topics, and choose a larger value to find less and bigger topics: