To train the models, we use the run_train.py CLI script. The script supports various arguments. See the training/README.md file for more information, along with the hyperparameters used in the paper.
Phi-3-MLX is a versatile AI framework that leverages both the Phi-3-Vision multimodal model and the Phi-3-Mini-128K language model, optimized for Apple Silicon using the MLX framework. This project ...
Abstract: The application of Large Language Models (LLMs) has been a deeply explored topic, but with little focus on utilizing LLMs for predicting ICD-10 patient diagnoses in the medical field. Using ...