tidytext

Text mining using tidy tools

This is an exact mirror of the tidytext project, hosted at https://github.com/juliasilge/tidytext. SourceForge is not affiliated with tidytext.

Add a Review

Downloads: 1 This Week

Last Update: 2025-07-30

Download

Get an email when there's a new version of tidytext

Linux Mac Windows

tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.

Features

Tokenizes text into tidy format (unnest_tokens)
Supports sentiment lexicons (e.g. Bing, NRC) and TF-IDF computation
Converts tm or quanteda objects into tidy data formats
Easy integration with dplyr/ggplot2 for analysis and visualization
Functions for n-grams, word co-occurrence, and document-term matrices
Compatible with existing tidy data pipelines in R

Project Samples

Project Activity

See All Activity >

Follow tidytext

tidytext Web Site

Other Useful Business Software

The AI-powered unified PSA-RMM platform for modern MSPs.

Trusted PSA-RMM partner of MSPs worldwide

SuperOps.ai is the only PSA-RMM platform powered by intelligent automation and thoughtfully crafted for the new-age MSP. The platform also helps MSPs manage their projects, clients, and IT documents from a single place.

Learn More

Rate This Project

User Reviews

Be the first to post a review of tidytext!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Related Categories

R Natural Language Processing (NLP) Tool

Registered

2025-07-30

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a complete local AI runtime for .NET that lets engineering teams ship AI-powered features without cloud dependencies, per-token costs, or data leaving the network. Most .NET AI integrations stop at inference. LM-Kit.NET covers the full range of capabilities production...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3.5. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
kama.ai

A Responsible AI Agent platform providing accurate, accountable, and safe AI for your organization. As a Composite (hybrid) platform, it combines Knowledge Graph AI, governed Generative AI, and Intelligent Automation technologies. This combination gives you trusted answers that are accurate...

See Software
Enterprise Bot

Enterprise Bot, based in Switzerland, is a pioneer in Conversational AI, Process Automation, and Generative AI. With the trust of esteemed enterprise giants across industries like Generali, SIX, SBB, DHL, and SWICA, Enterprise Bot is revolutionizing both customer and employee experiences....

See Software
GPT-4

GPT-4 (Generative Pre-trained Transformer 4) is a large-scale unsupervised language model, yet to be released by OpenAI. GPT-4 is the successor to GPT-3 and part of the GPT-n series of natural language processing models, and was trained on a dataset of 45TB of text to produce human-like text...

See Software
Komprehend

Komprehend AI APIs are the most comprehensive set of document classification and NLP APIs for software developers. Our NLP models are trained on more than a billion documents and provide state-of-the-art accuracy on most common NLP use cases such as sentiment analysis and emotion detection. Try...

See Software

Report inappropriate content

The AI-powered unified PSA-RMM platform for modern MSPs.

Trusted PSA-RMM partner of MSPs worldwide

Learn More

Recommended Projects

Passport Index Dataset
Passport Index 2023: visa requirements for 199 countries, in .csv
tidyverse
Easily install and load packages from the tidyverse
VADER
Lexicon and rule-based sentiment analysis tool
Anime Viewer
to keep your anime collection tidy
htmLawed
PHP code to purify & filter HTML