Instructions to use pucpr-br/Clinical-BR-Mistral-7B-v0.2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use pucpr-br/Clinical-BR-Mistral-7B-v0.2 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="pucpr-br/Clinical-BR-Mistral-7B-v0.2")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("pucpr-br/Clinical-BR-Mistral-7B-v0.2")
model = AutoModelForCausalLM.from_pretrained("pucpr-br/Clinical-BR-Mistral-7B-v0.2")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use pucpr-br/Clinical-BR-Mistral-7B-v0.2 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "pucpr-br/Clinical-BR-Mistral-7B-v0.2"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "pucpr-br/Clinical-BR-Mistral-7B-v0.2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/pucpr-br/Clinical-BR-Mistral-7B-v0.2

SGLang

How to use pucpr-br/Clinical-BR-Mistral-7B-v0.2 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "pucpr-br/Clinical-BR-Mistral-7B-v0.2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "pucpr-br/Clinical-BR-Mistral-7B-v0.2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "pucpr-br/Clinical-BR-Mistral-7B-v0.2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "pucpr-br/Clinical-BR-Mistral-7B-v0.2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use pucpr-br/Clinical-BR-Mistral-7B-v0.2 with Docker Model Runner:
```
docker model run hf.co/pucpr-br/Clinical-BR-Mistral-7B-v0.2
```
Browse Quantizations to use this model in llama.cpp, Ollama, LM Studio, or any compatible app.

MED-LLM-BR: Medical Large Language Models for Brazilian Portuguese

MED-LLM-BR is a collaborative project between HAILab and Comsentimento, which aims to develop multiple medical LLMs for Portuguese language, including base models and task-specific models, with different sizes.

Introduction

Clinical-BR-Mistral-7B-v0.2 is a fine-tuned language model specifically designed for generating clinical notes in Portuguese. This model builds on the strengths of Mistral 7B, adapting it through targeted fine-tuning techniques to meet the unique demands of clinical text generation. By focusing on the nuances and complexities of medical language in Portuguese, Clinical-BR-Mistral-7B-v0.2 aims to support healthcare professionals with contextually accurate and relevant clinical documentation.

Fine-Tuning Approach

To enhance memory efficiency and reduce computational demands, we implemented LoRA with 16-bit precision on the q_proj and v_proj projections. We configured LoRA with R set to 8, Alpha to 16, and Dropout to 0.1, allowing the model to adapt effectively while preserving output quality. For optimization, the AdamW optimizer was used with parameters β1 = 0.9 and β2 = 0.999, achieving a balance between fast convergence and training stability. This careful tuning process ensures robust performance in generating accurate and contextually appropriate clinical text in Portuguese.

Data

The fine-tuning of Clinical-BR-Mistral-7B-v0.2 utilized 2.4GB of text from three clinical datasets. The SemClinBr project provided diverse clinical narratives from Brazilian hospitals, while the BRATECA dataset contributed admission notes from various departments in 10 hospitals. Additionally, data from Lopes et al., 2019, added neurology-focused texts from European Portuguese medical journals. These datasets collectively improved the model’s ability to generate accurate clinical notes in Portuguese.

Provisional Citation:

@inproceedings{pinto2024clinicalLLMs,
  title        = {Developing Resource-Efficient Clinical LLMs for Brazilian Portuguese},
  author       = {João Gabriel de Souza Pinto and Andrey Rodrigues de Freitas and Anderson Carlos Gomes Martins and Caroline Midori Rozza Sawazaki and Caroline Vidal and Lucas Emanuel Silva e Oliveira},
  booktitle    = {Proceedings of the 34th Brazilian Conference on Intelligent Systems (BRACIS)},
  year         = {2024},
  note         = {In press},
}

Downloads last month: 18,974

Safetensors

Model size

7B params

Tensor type

BF16

Model tree for pucpr-br/Clinical-BR-Mistral-7B-v0.2

Base model

mistralai/Mistral-7B-v0.3

Finetuned

(386)

this model

Quantizations

1 model