Package: textmineR 3.0.5.999

textmineR: Functions for Text Mining and Topic Modeling

An aid for text mining in R, with a syntax that should be familiar to experienced R users. Provides a wrapper for several topic models that take similarly-formatted input and give similarly-formatted output. Has additional functionality for analyzing and diagnostics for topic models.

Authors:Tommy Jones [aut, cre], William Doane [ctb], Mattias Attbom [ctb]

textmineR_3.0.5.999.tar.gz
textmineR_3.0.5.999.zip(r-4.5)textmineR_3.0.5.999.zip(r-4.4)textmineR_3.0.5.999.zip(r-4.3)
textmineR_3.0.5.999.tgz(r-4.5-x86_64)textmineR_3.0.5.999.tgz(r-4.5-arm64)textmineR_3.0.5.999.tgz(r-4.4-x86_64)textmineR_3.0.5.999.tgz(r-4.4-arm64)textmineR_3.0.5.999.tgz(r-4.3-x86_64)textmineR_3.0.5.999.tgz(r-4.3-arm64)
textmineR_3.0.5.999.tar.gz(r-4.5-noble)textmineR_3.0.5.999.tar.gz(r-4.4-noble)
textmineR_3.0.5.999.tgz(r-4.4-emscripten)textmineR_3.0.5.999.tgz(r-4.3-emscripten)
textmineR.pdf |textmineR.html✨
textmineR/json (API)
NEWS

# Install 'textmineR' in R:

install.packages('textmineR', repos = c('https://tommyjones.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/tommyjones/textminer/issues

Pkgdown site:https://www.rtextminer.com

Uses libs:

c++– GNU Standard C++ Library v3

Datasets:

nih_sample - Abstracts and metadata from NIH research grants awarded in 2014
nih_sample_dtm - Abstracts and metadata from NIH research grants awarded in 2014
nih_sample_topic_model - Abstracts and metadata from NIH research grants awarded in 2014

On CRAN:

cpp

10.83 score 106 stars 7 packages 310 scripts 3.2k downloads 5 mentions 33 exports 28 dependencies

Last updated 2 years agofrom:03b109d6e0. Checks:1 OK, 11 NOTE. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 05 2025
R-4.5-win-x86_64	NOTE	Mar 05 2025
R-4.5-mac-x86_64	NOTE	Mar 05 2025
R-4.5-mac-aarch64	NOTE	Mar 05 2025
R-4.5-linux-x86_64	NOTE	Mar 05 2025
R-4.4-win-x86_64	NOTE	Mar 05 2025
R-4.4-mac-x86_64	NOTE	Mar 05 2025
R-4.4-mac-aarch64	NOTE	Mar 05 2025
R-4.4-linux-x86_64	NOTE	Mar 05 2025
R-4.3-win-x86_64	NOTE	Mar 05 2025
R-4.3-mac-x86_64	NOTE	Mar 05 2025
R-4.3-mac-aarch64	NOTE	Mar 05 2025

Exports:CalcGamma CalcHellingerDist CalcJSDivergence CalcLikelihood CalcLikelihoodC CalcProbCoherence CalcSumSquares CalcTopicModelR2 Cluster2TopicModel CreateDtm CreateTcm dtm_to_lexicon_c Dtm2Docs Dtm2DocsC Dtm2Lexicon Dtm2Tcm fit_lda_c FitCtmModel FitLdaModel FitLsaModel GetProbableTerms GetTopTerms Hellinger_cpp HellingerMat JSD_cpp JSDmat LabelTopics posterior predict_lda_c SummarizeTopics TermDocFreq TmParallelApply update

Dependencies:cli data.table digest float glue gtools ISOcodes lattice lgr lifecycle magrittr Matrix MatrixExtra mlapi R6 Rcpp RcppArmadillo RcppEigen RcppProgress RhpcBLASctl rlang rsparse RSpectra stopwords stringi stringr text2vec vctrs

Start here

Thomas W. Jones

Rendered froma_start_here.Rmdusingknitr::rmarkdownon Mar 05 2025.

Last update: 2021-06-27
Started: 2018-02-10

document clustering

Thomas W. Jones

Rendered fromb_document_clustering.Rmdusingknitr::rmarkdownon Mar 05 2025.

Last update: 2021-06-27
Started: 2018-02-10

Topic modeling

Thomas W. Jones

Rendered fromc_topic_modeling.Rmdusingknitr::rmarkdownon Mar 05 2025.

Last update: 2022-05-11
Started: 2018-02-10

Text embeddings

Thomas W. Jones

Rendered fromd_text_embeddings.Rmdusingknitr::rmarkdownon Mar 05 2025.

Last update: 2021-06-27
Started: 2018-02-10

Document summarization

Thomas W. Jones

Rendered frome_doc_summarization.Rmdusingknitr::rmarkdownon Mar 05 2025.

Last update: 2019-01-04
Started: 2018-02-10

Using tidytext with textmineR

Thomas W. Jones

Rendered fromf_tidytext_example.Rmdusingknitr::rmarkdownon Mar 05 2025.

Last update: 2019-01-09
Started: 2018-12-23

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
Calculate a matrix whose rows represent P(topic_i\|tokens)	CalcGamma
Calculate Hellinger Distance	CalcHellingerDist
Calculate Jensen-Shannon Divergence	CalcJSDivergence
Calculate the log likelihood of a document term matrix given a topic model	CalcLikelihood
Probabilistic coherence of topics	CalcProbCoherence
Calculate the R-squared of a topic model.	CalcTopicModelR2
Represent a document clustering as a topic model	Cluster2TopicModel
Convert a character vector to a document term matrix.	CreateDtm
Convert a character vector to a term co-occurrence matrix.	CreateTcm
Convert a DTM to a Character Vector of documents	Dtm2Docs
Turn a document term matrix into a list for LDA Gibbs sampling	Dtm2Lexicon
Turn a document term matrix into a term co-occurrence matrix	Dtm2Tcm
Fit a Correlated Topic Model	FitCtmModel
Fit a Latent Dirichlet Allocation topic model	FitLdaModel
Fit a topic model using Latent Semantic Analysis	FitLsaModel
Get cluster labels using a "more probable" method of terms	GetProbableTerms
Get Top Terms for each topic from a topic model	GetTopTerms
Internal helper functions for 'textmineR'	CalcLikelihoodC CalcSumSquares Dtm2DocsC dtm_to_lexicon_c fit_lda_c HellingerMat Hellinger_cpp JSDmat JSD_cpp predict_lda_c
Get some topic labels using a "more probable" method of terms	LabelTopics
Abstracts and metadata from NIH research grants awarded in 2014	nih nih_sample nih_sample_dtm nih_sample_topic_model
Posterior methods for topic models	posterior
Draw from the posterior of an LDA topic model	posterior.lda_topic_model
Predict method for Correlated topic models (CTM)	predict.ctm_topic_model
Get predictions from a Latent Dirichlet Allocation model	predict.lda_topic_model
Predict method for LSA topic models	predict.lsa_topic_model
Summarize topics in a topic model	SummarizeTopics
Get term frequencies and document frequencies from a document term matrix.	TermDocFreq
textmineR	textmineR
An OS-independent parallel version of 'lapply'	TmParallelApply
Update methods for topic models	update
Update a Latent Dirichlet Allocation topic model with new data	update.lda_topic_model

Package: textmineR 3.0.5.999

textmineR: Functions for Text Mining and Topic Modeling

Start here

document clustering

Topic modeling

Text embeddings

Document summarization

Using tidytext with textmineR

Citation

Development and contributors

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)