This project is mirrored from https://github.com/Mintplex-Labs/anything-llm.
Pull mirroring updated .
- Dec 15, 2023
-
-
timothycarambat authored
linting
-
Timothy Carambat authored
* feat: Embed on-instance Whisper model for audio/mp4 transcribing resolves #329 * additional logging * add placeholder for tmp folder in collector storage Add cleanup of hotdir and tmp on collector boot to prevent hanging files split loading of model and file conversion into concurrency * update README * update model size * update supported filetypes
-
- Dec 14, 2023
-
-
Timothy Carambat authored
* wip: init refactor of document processor to JS * add NodeJs PDF support * wip: partity with python processor feat: add pptx support * fix: forgot files * Remove python scripts totally * wip:update docker to boot new collector * add package.json support * update dockerfile for new build * update gitignore and linting * add more protections on file lookup * update package.json * test build * update docker commands to use cap-add=SYS_ADMIN so web scraper can run update all scripts to reflect this remove docker build for branch
-
- Dec 13, 2023
-
-
Timothy Carambat authored
resolves #298
-
- Dec 11, 2023
-
-
Timothy Carambat authored
resolves #418
-
- Nov 17, 2023
-
-
timothycarambat authored
-
Sean Hatfield authored
* WIP adding url uploads to document picker * fix manual script for uploading url to custom-documents * fix metadata for url scraping * wip url parsing * update how async link scraping works * docker-compose defaults added no autocomplete on URLs --------- Co-authored-by:
timothycarambat <rambat1010@gmail.com>
-
- Nov 16, 2023
-
-
Sean Hatfield authored
add id to all metadata to prevent errors in frontend docuemnt picker Co-authored-by:
timothycarambat <rambat1010@gmail.com>
-
timothycarambat authored
-
- Nov 15, 2023
-
-
timothycarambat authored
-
- Nov 01, 2023
-
-
Timothy Carambat authored
-
- Oct 30, 2023
-
-
Francisco Bischoff authored
* mbox parsing improvements v1 * autobots roll out!
-
- Sep 29, 2023
-
-
Timothy Carambat authored
resolves #257
-
- Sep 18, 2023
-
-
Timothy Carambat authored
pass messages to frontend on success/failure resolves #242
-
Timothy Carambat authored
* cosmetic changes to be compatible to hadolint * common configuration for most editors until better plugins comes up * Changes on PDF metadata, using PyMuPDF (faster and more compatible) * small changes on other file ingestions in order to try to keep the fields equal * Lint, review, and review * fixed unknown chars * Use PyMuPDF for pdf loading for 200% speed increase linting --------- Co-authored-by:
Francisco Bischoff <franzbischoff@gmail.com> Co-authored-by:
Francisco Bischoff <984592+franzbischoff@users.noreply.github.com>
-
- Aug 14, 2023
-
-
Melroy van den Berg authored
Upgrade fake-useragent to latest version (v1.2.1). Disclaimer: I'm the package maintainer.
-
- Aug 04, 2023
-
-
Timothy Carambat authored
append time value to folder name to prevent duplicate uploads
-
- Jul 06, 2023
-
-
AntonioCiolino authored
* . * twitter feature update * Key validation and operation
-
- Jul 05, 2023
-
-
Timothy Carambat authored
-
- Jun 26, 2023
-
-
mplawner authored
* Update filetypes.py Added mbox format * Created new file Added support for mbox files as used by many email services, including Google Takeout's Gmail archive. * Update filetypes.py * Update as_mbox.py
-
- Jun 19, 2023
-
-
AntonioCiolino authored
* Enable web scraping based on a urtl and a simple filter. * ignore yarn * Updated Link scraper to avoid NoneType error.
-
- Jun 18, 2023
-
-
frasergr authored
-
- Jun 17, 2023
-
-
AntonioCiolino authored
-
timothycarambat authored
-
- Jun 16, 2023
-
-
Timothy Carambat authored
* implement dnd uploader show file upload progress write files to hotdirector build simple flaskAPI to process files one off * move document processor calls to util build out dockerfile to run both procs at the same time update UI to check for document processor before upload * disable pragma update on boot * dockerfile changes * add filetype restrictions based on python app support response and show rejected files in the UI * cleanup * stub migrations on boot to prevent exit condition * update CF template for AWS deploy
-
AntonioCiolino authored
fixed typo for text.
-
- Jun 15, 2023
-
-
Skid Vis authored
* Adds ability to import sitemaps to include a website * adds example sitemap url * adds filter to bypass common image formats * moves filetype ignoring to sitemap script
-
- Jun 14, 2023
-
-
Skid Vis authored
* Adds ability to import sitemaps to include a website * adds example sitemap url
-
- Jun 13, 2023
-
-
frasergr authored
* Updates for Linux for frontend/server * frontend/server docker * updated Dockerfile for deps related to node vectordb * updates for collector in docker * docker deps for ODT processing * ignore another collector dir * storage mount improvements; run as UID * fix pypandoc version typo * permissions fixes
-
- Jun 12, 2023
-
-
Fabio authored
Co-authored-by:
Carvalho, Fabio <Fabio_Carvalho@comcast.com>
-
- Jun 09, 2023
-
-
timothycarambat authored
-
- Jun 04, 2023
-
-
timothycarambat authored
-