awesome-akash/Falcon-7B at master · andy108369/awesome-akash

History

Name		Name	Last commit message	Last commit date
parent directory ..
Dockerfile		Dockerfile
deploy.yml		deploy.yml
falcon7b.py		falcon7b.py
index.html		index.html
readme.md		readme.md

readme.md

To quote the creator:

Falcon-7B-Instruct is a 7B parameters causal decoder-only model built by TII based on Falcon-7B and finetuned on a mixture of chat/instruct datasets. It is made available under the Apache 2.0 license.

Why use Falcon-7B-Instruct?

You are looking for a ready-to-use chat/instruct model based on Falcon-7B. Falcon-7B is a strong base model, outperforming comparable open-source models (e.g., MPT-7B, StableLM, RedPajama etc.), thanks to being trained on 1,500B tokens of RefinedWeb enhanced with curated corpora. See the OpenLLM Leaderboard. It features an architecture optimized for inference, with FlashAttention (Dao et al., 2022) and multiquery (Shazeer et al., 2019).

Originally found on huggingface.co: https://huggingface.co/tiiuae/falcon-7b-instruct

Typically for GPUs with more than 16GB of VRAM. A demonstration of the working model on Akash network running on an nVidia A100 and an nVidia RTX 3090 can be found here: https://www.youtube.com/watch?v=1wL0cIbQ2x4&t=1s&pp=ygUJZXVyb3Bsb3Rz

The model has some 10GB to download so please give it time to start.

TODO: make it possible to continue the conversation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Falcon-7B

Falcon-7B

readme.md

Files

Falcon-7B

Directory actions

More options

Directory actions

More options

Latest commit

History

Falcon-7B

Folders and files

parent directory

readme.md