Package: inpdfr 0.1.12

inpdfr: Analyse Text Documents Using Ecological Tools

A set of functions to analyse and compare texts, using classical text mining functions, as well as those from theoretical ecology.

Authors:Rebaudo Francois

inpdfr_0.1.12.tar.gz
inpdfr_0.1.12.zip(r-4.5)inpdfr_0.1.12.zip(r-4.4)inpdfr_0.1.12.zip(r-4.3)
inpdfr_0.1.12.tgz(r-4.4-any)inpdfr_0.1.12.tgz(r-4.3-any)
inpdfr_0.1.12.tar.gz(r-4.5-noble)inpdfr_0.1.12.tar.gz(r-4.4-noble)
inpdfr_0.1.12.tgz(r-4.4-emscripten)inpdfr_0.1.12.tgz(r-4.3-emscripten)
inpdfr.pdf |inpdfr.html
inpdfr/json (API)
NEWS

# Install 'inpdfr' in R:
install.packages('inpdfr', repos = c('https://frareb.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/frareb/inpdfr/issues

Datasets:

On CRAN:

24 exports 2 stars 0.84 score 89 dependencies 26 scripts 269 downloads

Last updated 1 years agofrom:845a901af9. Checks:OK: 7. Indexed: yes.

TargetResultDate
Doc / VignettesOKSep 17 2024
R-4.5-winOKSep 17 2024
R-4.5-linuxOKSep 17 2024
R-4.4-winOKSep 17 2024
R-4.4-macOKSep 17 2024
R-4.3-winOKSep 17 2024
R-4.3-macOKSep 17 2024

Exports:doClusterdoKmeansClustdoMetacomEntropartdoMetacomMetacomexcludeStopWordsgetAllAnalysisgetListFilesgetMostFreqWordgetMostFreqWordCorgetPDFgetStopWordsgetSummaryStatsBARPLOTgetSummaryStatsHISTOgetSummaryStatsOCCURgetTXTgetwordOccuDFgetXFreqWordIdentifyStructuremakeWordcloudmergeWordFreqpostProcTxtpreProcTxtquitSpaceFromCharstruncNumWords

Dependencies:abindapebackportsbase64encBHbootbroomcarcarDatacliclustercolorspacecorrplotcowplotcpp11DerivdigestdoBydplyrentropartEntropyEstimationfansifarvergenericsggplot2ggpubrggrepelggsciggsignifgluegridExtragtableisobandlabelinglatticelifecyclelme4magrittrMASSMatrixMatrixModelsmetacommgcvmicrobenchmarkminqamodelrmunsellnlmenloptrNLPnnetnumDerivpbkrtestpermutepillarpkgconfigplyrpolynompurrrquantregR.devicesR.methodsS3R.ooR.utilsR6RColorBrewerRcppRcppEigenreshape2rlangrstatixscalesslamSnowballCSparseMstringistringrsurvivaltibbletidyrtidyselecttmutf8vctrsveganviridisLitewithrwordcloudxml2

Introduction to inpdfr package

Rendered frominpdfr-vignette.Rmdusingknitr::rmarkdownon Sep 17 2024.

Last update: 2023-08-24
Started: 2016-03-11

Readme and manuals

Help Manual

Help pageTopics
Performs a cluster analysis on the basis of the word-occurrence data.frame.doCluster
Performs a k-means cluster analysis on the basis of the word-occurrence data.frame.doKmeansClust
Performs an analysis of ecological diversity and structure.doMetacomEntropart
Performs a metacomunity analysis.doMetacomMetacom
Exclude StopWords form the word-occurrence data.frame.excludeStopWords
Stop words in French.exclusionList_FR
Stop words in Spanish.exclusionList_SP
Stop words in English.exclusionList_UK
A quick way to compute a set of analysis from the word-occurrence data.frame.getAllAnalysis
List files in a specified directory sorted by extension.getListFiles
Returns most frequent words.getMostFreqWord
Test for correlation between the most frequent words.getMostFreqWordCor
Extract text from PDF files and return a word-occurrence data.frame.getPDF
Load a list of stopwords.getStopWords
Perform a barplot with the number of unique words per documentgetSummaryStatsBARPLOT
Plot an histogram with the number of words excluding stop wordsgetSummaryStatsHISTO
Plot a scatter plot with the proportion of documents using similar words.getSummaryStatsOCCUR
Extract text from TXT files and return a word-occurrence data.frame.getTXT
A quick way to obtain the word-occurrence data.frame from a set of documents.getwordOccuDF
Returns most frequent wordsgetXFreqWord
Copy of the identifyStructure function from Tad Dallas metacom package.IdentifyStructure
inpdfr: A package to analyse PDF Files Using Ecological Tools.inpdfr-package inpdfr
Lorem Ipsum text.loremIpsum
Word cloud based on the word-occurrence data.frame.makeWordcloud
Merge word-occurrence data.frames into a single data.frame.mergeWordFreq
Prossess vectors containing words into a data.frame of word occurrences.postProcTxt
Extract text from txt files and pre-process content.preProcTxt
Delete spaces in file names.quitSpaceFromChars
Truncate the word-occurrence data.frame.truncNumWords
Lorem Ipsum word occurrences.wordOccuDF