Differences

This shows you the differences between two versions of the page.

--- tutorials:ai_llm_tools [2025/06/09 14:48] – [What?] chkuo
+++ tutorials:ai_llm_tools [2025/06/12 17:40] (current) – [Value] chkuo
@@ Line 2: / Line 2: @@
 ===== About =====
-  * A brief guide to using large language models (LLMs), such as ChatGPT by OpenAI and Gemini by Google, which are often referred to as artificial intelligence (AI) tools, in the context of academic research. By Chih-Horng Kuo (chk@gate.sinica.edu.tw). Suggestions are welcome.
+  * A brief guide to using large language models (LLMs) often referred to as artificial intelligence (AI) tools, such as ChatGPT by OpenAI and Gemini by Google, in the context of academic research and writing. By Chih-Horng Kuo (chk@gate.sinica.edu.tw). Suggestions are welcome.
   * Target audience: undergraduate students, graduate students, and postdocs (in biology)
 ===== Preface =====
@@ Line 10: / Line 11: @@
     * As the user, you are responsible for the final output of your work
     * Include proper usage statements
-  * For personal use: It is possible and potentially beneficial to use LLMs for emotional support, but should not be mistaken for true care or therapeutic depth. Be careful of potential misuse and harm.
+  * For personal use:
+    * Emotional support can be valuable and beneficial; graduate school (or life) is hard!
+    * Should not be mistaken for true care or therapeutic depth
+    * Be careful of potential misuse and harm
 ===== LLMs Explained =====
-==== What? ====
+==== What are LLMs? ====
-  * Artificial intelligence systems trained on vast amounts of text data to understand and generate human language
+  * Artificial intelligence systems trained on large collections of text to generate human-like responses
-    * "Predicting the next word in a sequence based on the training"; this is not "real intelligence"
+    * Predict the next word (token) based on statistical patterns
-    * Do not "think" like humans do
+    * Do not “think” or “understand” like humans
+    * Mimic intelligence, but are not truly “intelligent”; can be very convincing!
   * Key concept
-    * Language model (vs. database): probability relationships among words (token)
+    * LLMs work by estimating probability relationships among tokens, not by indexing content like a database such as Wikipedia
+  * Key terms
     * Token: basic unit for LLMs to process information
     * Prompt: user input
-    * Session: user input + LLM output, combined to generate text
+    * Session: a series of user input and LLM output
-    * Memory
+      * Good practice: limit the scope and length of individual sessions
+    * Memory: information kept from previous interactions; some systems provide this function
-==== Limitations and Failure Modes ====
-  * LLMs cannot assess the quality of information sources. As a result, they cannot reliably distinguish between high- and low-quality content or apply appropriate weighting.
+==== Value ====
-      * Academia: "good" vs. "bad" papers
+  * Always-available conversation partners
-      * Outside of academia: biased information from paid advertising and other influence
+    * Often, just having a conversation helps you think much more clearly
-	* LLMs do not know what they do not know; may make up false info (AI hallucination)
+    * Particularly helpful in the context of emotional support
-  * Bias in training data set; language
+  * In some ways, LLMs are like crystals of human knowledge
+  * Useful for learning new topics efficiently through interactive Q&A
+  * Perform especially well on topics with abundant, high-quality training data
+  * Highly proficient in language, particularly useful for clear and precise expression of ideas
+  * Great tools for self-improvement
+    * How much can LLMs be trusted? I don't know
+    * Have I successfully used LLMs to achieve deeper thinking and clearer writing? YES!
+    * What's the catch? Time and effort
+==== Limitations ====
+  * Limitations and biases exist in both the training data and the model development process
+    * English is the dominant language
+    * Often optimized to be agreeable and non-confrontational
+    * Limitations of human knowledge and available text data sets
+  * LLMs cannot assess the quality of sources
+    * Cannot reliably distinguish between high- and low-quality content
+    * Cannot apply appropriate weighting
+    * Academia: "good" vs. "bad" papers
+    * Outside of academia: biased information from paid advertising and other influence
+  * LLMs do not know what they do not know; may make up false info (AI hallucination)
   * For scientific writing
-    * May not be suitable to process novel findings; summarizing and explaining existing knowledge may work better
+    * Summarizing and explaining existing knowledge often work well
+    * May not be suitable for processing novel findings
+==== Common Failure Modes ====
+  * Superficial content
+  * Oversimplification
+  * Overstatement
+  * False coherence
+  * Hallucination; fabricated web links, DOIs, and PubMed IDs
+  * Non-specific terminology for highly specific topics
-==== Failure Modes ====
-  * Common issues: non-specific terminology, superficial, overstatement, false coherence, hallucination
 ===== Core Principles for Usage =====
   * Use LLMs as tools to sharpen your thinking, not black boxes for quick answers
-  * Using LLMs does not necessarily save time; more useful to use LLMs to deepen your thinking process (which actually takes more time)
+  * Do not outsource your thinking process!
-  * Good practice involves: iterate, reflect, calibrate, and finalize
+    * Use LLMs in your thinking and learning: you become better
-    * Start with a solid frame of reference. Provide sufficient info for the task, as well as your requirement and expectation for the LLM.
+    * Use LLMs to think for you: you skip the practice, bad for learning
+  * LLMs may not save time
+    * In fact, LLMs are most valuable when used to deepen the thinking process
+    * Explore possibilities, compare alternatives, and refine nuances all require time and effort, but the process and end product can be rewarding
+    * Example: You are deciding the title of your manuscript. You provided the abstract, do you ask LLMs to:
+      * (1) suggest a title, or
+      * (2) suggest three titles, compare the emphasis and tone, refine to express accurately, adjust based on the target audience
+  * LLMs act like mirrors; their responses reflect your input and framing
+    * Trained to be agreeable, they often conform to both your explicit instructions and implicit tone
+    * Helpful to make your implicit thoughts become explicit and better organized
+  * Suggested practice
+    * Start with a solid frame of reference
+      * Provide clear context for the task, including your goals and expectations
       * What exactly do you want to obtain?
-      * How do you want to LLM to act? Is your expectation realistic? For example, these are possible but differ in feasibility and reliability: polishing arguments, organize thoughts, brainstorm new ideas, identify major gaps in arguments, and critical review.
+      * How do you want the LLM to act?
-    * Assess the output, make decision, and provide feedback
+      * Are your expectations realistic?
-    * Iterative process; may involve gradual refinement and/or drastic changes
+      * Examples of possible uses (varying in feasibility and reliability):
-    * Knowing when to stop is important
+        * Polish arguments
+        * Organize thoughts
+        * Brainstorm new ideas
+        * Identify major gaps in reasoning
+        * Provide critical review
+    * Assess the output, make decisions, and provide feedback
+    * Use an iterative process; may involve gradual refinement or drastic changes
+    * Fact check, fact check, and fact check!!!
+    * Know when to stop; did the last iteration provide meaningful improvement?
 ===== Human–AI Dynamics =====
-  * By design, LLMs mirror your tone and reinforce your framing
+  * LLMs mirror your tone, assumptions, and style
-  * The output can become too agreeable
+  * Output can become overly agreeable, especially in long sessions
-  * Asking LLMs to role-play can be useful, but mostly because the process can stimulate YOUR thinking, not because LLM in different roles truly have different expertises
+    * It is natural to feel defensive when an LLM points out flaws in your argument
+    * However, LLMs are easily swayed by your rebuttals
+    * Reinforcing your own biases is not useful
+  * Asking LLMs to role-play can be useful
+    * Not because the roles provide true expertise, but because the exercise can stimulate YOUR thinking
+    * For example, asking an LLM to act as a qualifying committee member or manuscript reviewer may help you prepare
+    * In these roles, LLMs can mimic tone and style, but they lack the knowledge, logic, and Judgment of human experts
 ===== Use Cases =====
-  * "Deep Research" function (available in both ChatGPT & Gemini) for targeted search of PubMed and summary
+  * Writing and obtaining feedback as a way of thinking; force LLMs to criticize and challenge your thoughts
+  * Use the "Deep Research" function (available in both ChatGPT & Gemini) for targeted search of PubMed and follow-up summary
   * Reading dense papers
-  * Build custom database of selected references using NotebookLM by Google
+  * Build custom databases of selected references using NotebookLM by Google
   * Grammar check and copy editing
+  * Prepare for discussion with seminar speakers; info about you and the speaker as the input, suggestions of questions to ask as the output
   * Drafting or refining emails (with tone sensitivity)
-  * To prepare for individual meeting with seminar speakers. Given your interest and the speaker's expertise, suggest a few questions
+  * Technology related, such as server/intranet setup, hardware purchase planning, Linux/NAS configuration, or organization of computer files and directories
-  * Technology related, such as server/intranet setup, hardware purchase planning, Linux/NAS configuration, organization of computer files and directories
-===== Other Notable Points =====
-  * Test the tool with something you know first
+===== Additional Points =====
+  * Test the tool with topics that you know well first for evaluation
+  * Push boundaries, learn the situations that LLMs are useful (or not)
+  * Different tools for different uses
+    * ChatGPT is chatty; highly conversational, discussions can take unexpected turns, great for exploration
+    * Gemini integrates Google Search; useful for questions with clear answers and you want the web sources
+  * Learn to judge output, know when to reject the suggestions
   * Over time, thoughtful use of LLMs can improve your writing, reflection, and decision-making skills
   * AI/LLM tools are under intensive development, different tools may perform better at different tasks; develop your own taste and workflow, keep an eye on the development
-  * In future, AI/LLM may be useful "simulated advisor"?