Show HN: finetune LLMs via the Finetuning Hub https://ift.tt/Ml3U7qJ

Show HN: finetune LLMs via the Finetuning Hub Hi HN community, I have been working on benchmarking publicly available LLMs these past couple of weeks. More precisely, I am interested on the finetuning piece since a lot of businesses are starting to entertain the idea of self-hosting LLMs trained on their proprietary data rather than relying on third party APIs. To this point, I am tracking the following 4 pillars of evaluation that businesses are typically look into: - Performance - Time to train an LLM - Cost to train an LLM - Inference (throughput / latency / cost per token) For each LLM, my aim is to benchmark them for popular tasks, i.e., classification and summarization. Moreover, I would like to compare them against each other. So far, I have benchmarked Flan-T5-Large, Falcon-7B and RedPajama and have found them to be very efficient in low-data situations, i.e., when there are very few annotated samples. Llama2-7B/13B and Writer’s Palmyra are in the pipeline. But there’s so many LLMs out there! In case this work interests you, would be great to join forces. GitHub repo attached — feedback is always welcome :) Happy hacking! https://ift.tt/rRfYv20 September 4, 2023 at 05:16AM

Show HN: finetune LLMs via the Finetuning Hub https://ift.tt/Ml3U7qJ

You may like these posts

Post a Comment

0 Comments

Main Menu

Social Counter

Social Media Icons 2

Menu

Post Top Ad

Search This Blog

Archive

Post Top Ad

Social Media Icons

Post Top Ad

Author Details

Latest Posts

Comments

Social Media Icons

Breaking

Featured Posts

Recent in Sports

Music

Text Widget

Sample Text

About Me

Send Quick Message

Menu

Gallery

Recent News

About Sure Mag

Pages

Comment

Mobile Logo Settings

Social Media Icons

Main Menu

Popular

Social Plugin

Popular Posts

Subscribe Us

Facebook

Categories

Recent Posts

Categories

Tags

Recent in Recipes

Menu Footer Widget