Log: Mini Bioinformatics project(1/2), Analysis HBB gene

After 2 months of starting the MS Bioinformatics program,

I’ve met a lot of great people and learned so much, but I haven’t written any code.

And my good old friend, anxiety, is now screaming.

I know that writing simple code and archiving it is a good cure for this kind of anxiety.

So, I decided to do an itty-bitty, cute little bioinformatics project.

I’m doing it with ChatGPT, and I hope my logs can guide future bioinformaticians who feel lost, just like me.


My first practice project: Analyzing the HBB gene.

It’s familiar to me and a well-studied gene.

I’ll check the GC ratio and base frequency (ATGC) and visualize them.

You see? It’s super simple :)


Today, I set up my environment:

✅ Installed Linux OS on a laptop (Ubuntu)
✅ Installed Miniconda (Jupyter Notebook)
✅ Set up a Conda virtual environment
✅ Configured SSH authentication to use GitHub

And I wrote code in Jupyter Notebook to:

  1. Download the FASTA file from NCBI.
  2. Print the gene name, length, and first 100 bases.

🔗 GitHub Repository: ncbi_seq_analysis_hbb

📝 Today’s commit: e398c55


 

... I’ll work more in two days

very great journey begins with a single step!

Happy Bioinformatics! 🎉

Comments