Happy New Year!
At the end of 2023, I started mentioning ChatGPT.
Merry Christimas! ... Also my thoughts about GPT
To show what makes GPT models so special, it would be interesting to give a basic introduction to large language models (LLMs) in general, focusing on building simple LLMs using T5 models.
You may be wondering "What is LLM?", "What are T5 models?", "You talked about GPT last month, so why use T5 models instead?"
What is LLM?
What are T5-models?
You talked about GPT last month, so why use T5 models instead?
- Intuitively easier to understand LLM concepts: I'll write about this in more details in future blog posts, but building T5 models require a basic understanding to how LLM reads, understands, and generates human text. T5 models are SUPER easy to build so you can have the satisfaction of building your own AI while also learning how they work at the same time! GPT models are very easy to use but most of how the LLM generates text isn't made apparent. (GPT models are really cool in this aspect but more on this in the future.)
- Cost: OpenAI gives an initial $5 credit to use OpenAI models, including GPT models, but after that's spent we have to pay to use them. T5 models are completely free to build!
- Simple coding: Basic T5 models only require a few lines of code!