Skip to content

Latest commit

 

History

History
97 lines (65 loc) · 2.99 KB

CREDITS.md

File metadata and controls

97 lines (65 loc) · 2.99 KB

Fupi - Thanks and Credits

Amata mea Argeia - gratiam magnam tibi ago!
Patientia tua in studiis meis computatoriis auxilium meum maximum!

Many thanks to my friend and colleague Adam Fauzi, who joined me in my journey with Fupi and helped me in the selection and evaluation of machine learning models among other things!

LanceDB

https://github.com/lancedb/lancedb
https://lancedb.github.io/lance/index.html
https://lancedb.github.io/lancedb/

Models

https://huggingface.co/intfloat/multilingual-e5-large
https://huggingface.co/Qdrant/multilingual-e5-large-onnx
https://huggingface.co/BAAI/bge-m3
https://huggingface.co/aapot/bge-m3-onnx
https://huggingface.co/ddmitov/bge_m3_dense_colbert_onnx
https://huggingface.co/michaelfeil/ct2fast-m2m100_418M

ONNX

https://onnx.ai/

https://onnxruntime.ai/
https://onnxruntime.ai/docs/execution-providers/
https://onnxruntime.ai/docs/performance/tune-performance/threading.html

https://huggingface.co/docs/transformers/serialization
https://huggingface.co/docs/optimum/v1.2.1/en/onnxruntime/modeling_ort
https://huggingface.co/docs/optimum/onnxruntime/usage_guides/pipelines

Datasets

https://www.kaggle.com/datasets/jbencina/department-of-justice-20092018-press-releases
https://huggingface.co/datasets/CloverSearch/cc-news-mutlilingual
https://commoncrawl.org/blog/news-dataset-available

Papers

https://arxiv.org/abs/2212.03533
https://arxiv.org/abs/2402.05672

Stack Overflow

https://stackoverflow.com/questions/65419499/download-pre-trained-sentence-transformers-model-locally
https://stackoverflow.com/questions/62134409/how-to-compile-torch-1-5-0-without-gpu-support
https://stackoverflow.com/questions/18714587/how-to-calculate-centroid-in-python
https://stackoverflow.com/questions/2564137/how-to-terminate-a-thread-when-main-program-ends
https://stackoverflow.com/questions/35795452/checking-if-a-list-of-files-exists-before-proceeding
https://stackoverflow.com/questions/3459098/create-list-of-single-item-repeated-n-times
https://stackoverflow.com/questions/40697845/what-is-a-good-practice-to-check-if-an-environment-variable-exists-or-not

Real Python

https://realpython.com/python-type-hints-multiple-types/

Bash

https://linuxize.com/post/bash-write-to-file/

Fly.io

https://fly.io/docs/apps/
https://fly.io/docs/reference/configuration/
https://fly.io/docs/machines/runtime-environment/
https://fly.io/docs/apps/volume-storage/
https://fly.io/docs/hands-on/install-flyctl/
https://fly.io/docs/apps/

Cloudflare R2

https://developers.cloudflare.com/r2/examples/rclone/

MinIO

https://min.io/docs/minio/linux/developers/python/API.html
https://min.io/docs/minio/linux/reference/minio-mc/mc-alias-set.html
https://min.io/docs/minio/linux/reference/minio-mc/mc-cp.html

rclone

https://rclone.org/install/
https://rclone.org/flags/

Markdown

https://gist.github.com/DavidWells/7d2e0e1bc78f4ac59a123ddf8b74932d

Icon

https://www.svgrepo.com/svg/405198/giraffe