Package: JATSdecoder 1.2.0
JATSdecoder: A Metadata and Text Extraction and Manipulation Tool Set
Provides a function collection to extract metadata, sectioned text and study characteristics from scientific articles in 'NISO-JATS' format. Articles in PDF format can be converted to 'NISO-JATS' with the 'Content ExtRactor and MINEr' ('CERMINE', <https://github.com/CeON/CERMINE>). For convenience, two functions bundle the extraction heuristics: JATSdecoder() converts 'NISO-JATS'-tagged XML files to a structured list with elements title, author, journal, history, 'DOI', abstract, sectioned text and reference list. study.character() extracts multiple study characteristics like number of included studies, statistical methods used, alpha error, power, statistical results, correction method for multiple testing, software used. An estimation of the involved sample size is performed based on reports within the abstract and the reported degrees of freedom within statistical results. In addition, the package contains some useful functions to process text (text2sentences(), text2num(), ngram(), strsplit2(), grep2()). See Böschen, I. (2021) <doi:10.1007/s11192-021-04162-z> Böschen, I. (2021) <doi:10.1038/s41598-021-98782-3> and Böschen, I (2023) <doi:10.1038/s41598-022-27085-y>.
Authors:
JATSdecoder_1.2.0.tar.gz
JATSdecoder_1.2.0.zip(r-4.5)JATSdecoder_1.2.0.zip(r-4.4)JATSdecoder_1.2.0.zip(r-4.3)
JATSdecoder_1.2.0.tgz(r-4.4-any)JATSdecoder_1.2.0.tgz(r-4.3-any)
JATSdecoder_1.2.0.tar.gz(r-4.5-noble)JATSdecoder_1.2.0.tar.gz(r-4.4-noble)
JATSdecoder_1.2.0.tgz(r-4.4-emscripten)JATSdecoder_1.2.0.tgz(r-4.3-emscripten)
JATSdecoder.pdf |JATSdecoder.html✨
JATSdecoder/json (API)
# Install 'JATSdecoder' in R: |
install.packages('JATSdecoder', repos = c('https://ingmarboeschen.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/ingmarboeschen/jatsdecoder/issues
cermineniso-jatspubmedcentraltext-extractiontext-miningxml-files
Last updated 1 days agofrom:0e6c2d0b5c. Checks:OK: 1 WARNING: 6. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 22 2024 |
R-4.5-win | WARNING | Nov 22 2024 |
R-4.5-linux | WARNING | Nov 22 2024 |
R-4.4-win | WARNING | Nov 22 2024 |
R-4.4-mac | WARNING | Nov 22 2024 |
R-4.3-win | WARNING | Nov 22 2024 |
R-4.3-mac | WARNING | Nov 22 2024 |
Exports:allStatsest.ssget.abstractget.affget.alpha.errorget.assumptionsget.authorget.categoryget.contribget.countryget.doiget.editorget.historyget.journalget.keywordsget.methodget.multi.comparisonget.n.studiesget.outlier.defget.powerget.R.packageget.referencesget.sentence.with.patternget.sig.adjectivesget.softwareget.statsget.subjectget.tablesget.test.directionget.textget.titleget.typeget.volgrep2has.interactionJATSdecoderletter.convertngrampCheckpreCheckstandardStatsstrsplit2study.charactertext2numtext2sentencesvectorize.textwhich.term
Dependencies:NLPopenNLPopenNLPdatarJava
Readme and manuals
Help Manual
Help page | Topics |
---|---|
allStats | allStats |
est.ss | est.ss |
get.abstract | get.abstract |
get.aff | get.aff |
get.alpha.error | get.alpha.error |
get.assumptions | get.assumptions |
get.author | get.author |
get.category | get.category |
get.country | get.country |
get.doi | get.doi |
get.editor | get.editor |
get.history | get.history |
get.journal | get.journal |
get.keywords | get.keywords |
get.method | get.method |
get.multi.comparison | get.multi.comparison |
get.n.studies | get.n.studies |
get.outlier.def | get.outlier.def |
get.power | get.power |
get.R.package | get.R.package |
get.references | get.references |
get.sig.adjectives | get.sig.adjectives |
get.software | get.software |
get.stats | get.stats |
get.subject | get.subject |
get.tables | get.tables |
get.test.direction | get.test.direction |
get.text | get.text |
get.title | get.title |
get.type | get.type |
get.vol | get.vol |
grep2 | grep2 |
has.interaction | has.interaction |
JATSdecoder | JATSdecoder |
letter.convert | letter.convert |
ngram | ngram |
pCheck | pCheck |
standardStats | standardStats |
strsplit2 | strsplit2 |
study.character | study.character |
text2num | text2num |
text2sentences | text2sentences |
vectorize.text | vectorize.text |
which.term | which.term |