tutorials:file_management
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
tutorials:file_management [2023/08/20 23:43] – chkuo | tutorials:file_management [2025/06/07 02:38] (current) – [File Management] chkuo | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== File Management ====== | ====== File Management ====== | ||
- | + | | |
- | | + | * Related information: |
- | * Applicable to [[tutorials: | + | * [[tutorials: |
+ | * [[tutorials: | ||
===== Backup ===== | ===== Backup ===== | ||
* <color # | * <color # | ||
- | * You will experience file loss. Not __IF__, but __WHEN__. | + | * File loss happens. Not __IF__, but __WHEN__. |
* Establish an automatic backup plan; manual backup plans are not practical | * Establish an automatic backup plan; manual backup plans are not practical | ||
- | | + | |
- | * Linux: rsync scripts | + | * Linux: |
===== Working with collaborators ===== | ===== Working with collaborators ===== | ||
* Discuss and setup a workflow | * Discuss and setup a workflow | ||
* Avoid emailing files as attachments; | * Avoid emailing files as attachments; | ||
- | * Pros and cons of setting | + | * Setting |
+ | * File server: | ||
+ | * Pros: Full control, large storage, no vendor lock-in | ||
+ | * Cons: Requires hardware, set up, and maintenance | ||
+ | * File sharing service | ||
+ | * Pros: Easy, low cost | ||
+ | * Cons: Limited space, less control over structure, potential data leakage. | ||
===== File directory structure ===== | ===== File directory structure ===== | ||
* Make it easy to identify the parts that need to be included in backup | * Make it easy to identify the parts that need to be included in backup | ||
* Example for personal desktops/ | * Example for personal desktops/ | ||
+ | * A few top level folders with clear naming for easy backup/ | ||
* Example for shared servers | * Example for shared servers | ||
+ | * Depends on the research group, for example: data/, lab_doc/, project/, conference/ | ||
+ | * Clear rules, communication, | ||
===== Naming and version control ===== | ===== Naming and version control ===== | ||
* File names | * File names | ||
* Do: short and informative. Include information such as project id, keyword (e.g. manuscript, report, keyword of figure/ | * Do: short and informative. Include information such as project id, keyword (e.g. manuscript, report, keyword of figure/ | ||
- | * Avoid: | + | * Avoid: |
- | * Avoid: short names that are not informative (e.g., manuscript.docx, | + | * Long names that are difficult to read (e.g., full title of the manuscript) |
- | * Avoid: space or special characters | + | * Short names that are not informative (e.g., manuscript.docx, |
- | * Avoid: | + | * Space or special characters |
+ | * Names such as " | ||
* Versioning | * Versioning | ||
* Version number should be the last part of the file name; two digits should be sufficient | * Version number should be the last part of the file name; two digits should be sufficient | ||
Line 57: | Line 67: | ||
* After finalizing the order, put all early versions into a separate folder (e.g., " | * After finalizing the order, put all early versions into a separate folder (e.g., " | ||
* If the order changed, put the previous versions into another separate folder (e.g., " | * If the order changed, put the previous versions into another separate folder (e.g., " | ||
+ | * Very important to save the files in editable formats (e.g., .ai, not just .jpg) | ||
===== Tracking changes ===== | ===== Tracking changes ===== | ||
- | * For Word files: | + | * For Word files: |
- | * For Google Docs: all changes are automatically tracked. Manually | + | * For Google Docs: All changes are automatically tracked. Manually download the major versions |
+ | ===== Metadata ===== | ||
+ | * Metadata = data that explains your files: what they are, where they came from, how to use them. | ||
+ | * Examples | ||
+ | * For the raw data file of a gel image (20250509_1428.jpg), | ||
+ | * For a set of raw data files (e.g., Sanger sequencing results), or a file folder, provide a " | ||
tutorials/file_management.1692546219.txt.gz · Last modified: by chkuo