User Tools

Site Tools


tutorials:ai_llm_tools

This is an old revision of the document!


AI/LLM Tools

About

  • A brief guide to using large language models (LLMs) often referred to as artificial intelligence (AI) tools, such as ChatGPT by OpenAI and Gemini by Google, in the context of academic research and writing. By Chih-Horng Kuo (chk@gate.sinica.edu.tw). Suggestions are welcome.
  • Target audience: undergraduate students, graduate students, and postdocs (in biology)

Preface

  • For work: Ethics first!
    • Respect the general rules of academic integrity
    • As the user, you are responsible for the final output of your work
    • Include proper usage statements
  • For personal use: It is possible and potentially beneficial to use LLMs for emotional support, but should not be mistaken for true care or therapeutic depth. Be careful of potential misuse and harm.

LLMs Explained

What are LLMs?

  • Artificial intelligence systems trained on large collections of text to generate human-like responses
    • Predict the next word (token) based on statistical patterns
    • Do not “think” or “understand” like humans
    • Mimic intelligence, but are not truly “intelligent”; can be very convincing
  • Key concept
    • LLMs work by estimating probability relationships among tokens, not by indexing content like a database such as Wikipedia
  • Key terms
    • Token: basic unit for LLMs to process information
    • Prompt: user input
    • Session: a series of user input and LLM output
    • Memory: context retained within a session; some systems support cross-session memory

Value

  • Always-available conversation partner
  • In some ways, LLMs are like crystals of human knowledge
  • Useful for learning new topics efficiently through interactive Q&A
  • Perform especially well on topics with abundant, high-quality training data
  • Highly proficient in language, particularly useful for clear and precise expression of ideas

Limitations

  • Limitations and biases exist in both the training data and the model development process
    • English is the dominant language
    • Often optimized to be agreeable and non-confrontational
    • Limitations of human knowledge and available text data sets
  • LLMs cannot assess the quality of sources
    • Cannot reliably distinguish between high- and low-quality content
    • Cannot apply appropriate weighting
    • Academia: “good” vs. “bad” papers
    • Outside of academia: biased information from paid advertising and other influence
  • LLMs do not know what they do not know; may make up false info (AI hallucination)
  • For scientific writing
    • May not be suitable to process novel findings; summarizing and explaining existing knowledge may work better

Common Failure Modes

  • Non-specific terminology
  • Superficial content
  • Over simplification
  • Overstatement
  • False coherence
  • Hallucination; fabricated web links, DOIs, and PubMed IDs

Core Principles for Usage

  • Use LLMs as tools to sharpen your thinking, not black boxes for quick answers
  • Do not outsource your thinking process!
  • LLMs may not save time
    • In fact, LLMs are most valuable when used to deepen the thinking process
    • Explore possibilities, compare alternatives, and refine nuances all require time and effort, but the process and end product can be rewarding
    • For example: given the abstract, (1) suggest a title, or (2) suggest three titles, compare the emphasis and tone, refine to express accurately, adjust based on the target audience
  • LLMs act like mirrors; their responses reflect your input and framing
    • Trained to be agreeable, they often conform to both your explicit instructions and implicit tone
  • Suggested practice
    • Start with a solid frame of reference
      • Provide clear context for the task, including your goals and expectations
      • What exactly do you want to obtain?
      • How do you want the LLM to act?
      • Are your expectations realistic?
      • Examples of possible uses (varying in feasibility and reliability):
        • Polish arguments
        • Organize thoughts
        • Brainstorm new ideas
        • Identify major gaps in reasoning
        • Provide critical review
    • Assess the output, make decisions, and provide feedback
    • Use an iterative process; may involve gradual refinement or drastic changes
    • Fact check, fact check, and fact check
    • Know when to stop; what did you get in the last iteration?

Human–AI Dynamics

  • LLMs mirror your tone, assumptions, and style
  • Output can become overly agreeable, especially in long sessions
    • It is natural to feel defensive when an LLM points out flaws in your argument
    • However, LLMs are easily swayed by your rebuttals
    • Reinforcing your own biases is not useful
  • Asking LLMs to role-play can be useful
    • Not because the roles provide true expertise, but because the exercise can stimulate YOUR thinking
    • For example, asking an LLM to act as a qualifying committee member or manuscript reviewer may help you prepare
    • In these roles, LLMs can mimic tone and style, but they lack the knowledge, logic, and Judgment of human experts

Use Cases

  • Writing and obtaining feedback as a way of thinking; force LLMs to criticize and challenge your thoughts
  • Use the “Deep Research” function (available in both ChatGPT & Gemini) for targeted search of PubMed and follow-up summary
  • Reading dense papers
  • Build custom databases of selected references using NotebookLM by Google
  • Grammar check and copy editing
  • Prepare for discussion with seminar speakers; info about you and the speaker as the input, suggestions of questions to ask as the output
  • Drafting or refining emails (with tone sensitivity)
  • Technology related, such as server/intranet setup, hardware purchase planning, Linux/NAS configuration, or organization of computer files and directories

Additional Points

  • Test the tool with topics that you know well first for evaluation
  • Push boundaries, learn the situations that LLMs are useful (or not)
  • Different tools for different uses
    • ChatGPT is chatty; highly conversational, discussions can take unexpected turns, great for exploration
    • Gemini integrates Google Search; useful for questions with clear answers and you want the web sources
  • Learn to judge output, know when to reject the suggestions
  • Over time, thoughtful use of LLMs can improve your writing, reflection, and decision-making skills
  • AI/LLM tools are under intensive development, different tools may perform better at different tasks; develop your own taste and workflow, keep an eye on the development
tutorials/ai_llm_tools.1749569066.txt.gz · Last modified: by chkuo