The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
Detects and suggests fixes for semantic inconsistencies in data frames by calling large language models (LLMs) through a unified, provider-agnostic interface. Supported providers include 'OpenAI' ('GPT-4o', 'GPT-4o-mini'), 'Anthropic' ('Claude'), 'Google' ('Gemini'), 'Groq' (free-tier 'LLaMA' and 'Mixtral'), and local 'Ollama' models. The package identifies issues that rule-based tools cannot detect: abbreviation variants, typographic errors, case inconsistencies, and malformed values. Results are returned as tidy data frames with column, row index, detected value, issue type, suggested fix, and confidence score. An offline fallback using statistical and fuzzy-matching methods is provided for use without any API key. Interactive fix application with human review is supported via 'apply_fixes()'. Methods follow de Jonge and van der Loo (2013) <https://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo-Introduction_to_data_cleaning_with_R.pdf> and Chaudhuri et al. (2003) <doi:10.1145/872757.872796>.
| Version: | 0.1.0 |
| Depends: | R (≥ 4.1.0) |
| Imports: | stats, utils, dplyr (≥ 1.0.0), rlang (≥ 1.0.0) |
| Suggests: | knitr, rmarkdown, testthat (≥ 3.0.0), httr2 (≥ 1.0.0), jsonlite (≥ 1.8.0) |
| Published: | 2026-04-22 |
| DOI: | 10.32614/CRAN.package.llmclean (may not be active yet) |
| Author: | Sadikul Islam |
| Maintainer: | Sadikul Islam <sadikul.islamiasri at gmail.com> |
| License: | GPL-3 |
| NeedsCompilation: | no |
| CRAN checks: | llmclean results |
| Reference manual: | llmclean.html , llmclean.pdf |
| Vignettes: |
LLM-Assisted Data Cleaning with llmclean (source, R code) |
| Package source: | llmclean_0.1.0.tar.gz |
| Windows binaries: | r-devel: not available, r-release: not available, r-oldrel: not available |
| macOS binaries: | r-release (arm64): llmclean_0.1.0.tgz, r-oldrel (arm64): not available, r-release (x86_64): llmclean_0.1.0.tgz, r-oldrel (x86_64): llmclean_0.1.0.tgz |
Please use the canonical form https://CRAN.R-project.org/package=llmclean to link to this page.
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.