|
ChatGPT - otkrijte magiju Velikih Jezičkih Modela
ChatGPT: Discover the magic behind Large Language Models
Sažetak
Ovaj rad predstavlja pristupačan uvod u Velike Jezičke Modele (VJM) za inženjerske stručnjake izvan domena softverskog inženjerstva. Povlačeći paralele između ljudske kognicije i veštačkih neuronskih mreža, objašnjavamo fundamentalne koncepte koji stoje iza VJM-a, njihov proces treniranja i operativne karakteristike. Cilj rada je da demistifikuje ove kompleksne sisteme kroz pristupačne analogije i praktične primere, sa posebnim osvrtom na arhitekturu modela, proces treniranja i primenu u inženjerskoj praksi.
Abstract
This paper presents an accessible introduction to Large Language Models (LLMs) for engineering professionals outside the field of software engineering. By drawing parallels between human cognition and artificial neural networks, we explain the fundamental concepts behind LLMs, their training process, and operational characteristics. The aim of this paper is to demystify these complex systems through relatable analogies and practical examples, with special emphasis on model architecture, training process, and applications in engineering practice.
|