bookmarks for lifelong readers

Posted by chochang on Tue, Jul 22, 2025

References

Relates to:

  • [[Software Engineering]] (contains learning resource specifically for software engineering topics)
  • [[00 - Data Engineering]] (contains learning resources specifically for data engineering topics) –> online wiki: Data Engineering Wiki

Interesting sites:

  1. https://maximiliankiener.com/12/
  2. Brilliant Courses –> for beginning lifelong learner
  3. neal.fun –> funny little game
  4. tools for better thinking
  5. less wrong
  6. Interesting website for fun
  7. https://tinyclouds.org/
  8. the decision lab
  9. commoncog’s best post

Other’s blog/note:

Someone’s personal blogs that I can read and learn and borrow ideas from:

  1. Paul Graham: Essays
  2. Aaron Swartz’s:
  3. Sym-poly-masthesy –> the one excited with Rust 7 years ago, sharing about his career thoughts and miscs stuff
  4. Antirez (redis founder): https://antirez.com/latest/0
  5. https://brooker.co.za/blog/ (an aws developer I stumpled upon when finding distributed system papers to read –> he wrote about Guide of reading papers for SWE which I think kinda helpful –> [[2025-W07#Lifelong learning]])
  6. https://eatonphil.com/ –> focus on database, ditributed database system, good read. he also has a page listing his favorite developer blogs (https://eatonphil.com/blogs.html)
  7. https://luminousmen.com/

Other blogs: 6. https://jaehyeon.me/blog/ (DE blog) 7. https://braindump.jethro.dev/ (braindump, notes collection) 8. https://www.gaurgaurav.com/ (code thoughts - blogs share coding practices, programming tutorials) 9. https://dataengineering.wiki/Index –> data engineering wiki, sponsored by Data engineering jobs and great expectations 10. others’ threads (sharing cool readings): - @nqhieu2001

Research Papers

Github resources

Books

The one about discipline (kinda self-help): Put your ass where your heart wants to be

About books: https://huyenchip.com/2022/12/27/books-for-every-engineer.html

Thanks for sharing! - For 2023, consider perhaps: • Applied Minds: How Engineers Think, by Guru Madhavan • Out of our Minds: The Power of Being Creative, by Sir Ken Robinson • Peak: Secrets from the New Science of Expertise, by Anders Ericsson (a great counter-read for Range by David Epstein). Ericsson is the author from who’s work Gladwell (mis)appropriated the 10K Hr. rule. • Growing Wings on the Way: Systems Thinking for Messy Situations, by Rosalind Armson (This is the book I would have written on systems thinking if I could write a book!)

Other Books

Visualizing Google Cloud Architecting Google Cloud Solutions Building your next big thing with Google Cloud Platform Google Cloud Cookbook Data Science on the Google Cloud Platform Learning Google BigQuery

The Economic Benefits of Google Cloud Data Fusion https://services.google.com/fh/files/misc/esg-economic-validation-google-cloud-data-fusion.pdf

System Design Interview Streaming Systems: Large-scale data processing Grokking Streaming Systems: Real-time event processing

Data Related Stuff

gitlab handbook: Data Team Learning Library

Free data source and inspiration

Free data sources:

  1. statista.com –> empowering people with data (insights and facts across 170 industries and 150+ countries)
  2. ourworldindata.org –> research and data to make progress against the world’s largest problems
  3. list of open data sources: data sources for journalism and research; government; science and technology; international organizations;
  4. fivethirtyeight
  5. google: dataset search –> search dataset for research
  6. Free datasets for analytics projects: datastoryteller.gumroad.com
  7. github: public APIs for free: https://github.com/public-apis/public-apis
  8. https://power.larc.nasa.gov/

==From reddit thread:== List of open source data sources:

Reports/dashboard inspirations:

  1. Power BI DataViz World Championships
  2. pudding.cool

Data stacks newsletters

Resource collection: - github: data engineer handbook, all resources to learn about DE

https://devv.ai/ –> AI search tools specialized for Developer

Learning path:

Specialized Newsletters:

Data Companies’ resources:

Helpful posts on programming, cloud infrastructure, AI, software engineering topics. Piece of daily common encounters: 11. how.wtf

Data Product mindset

For data leader:

Data role should focus on impact:

Data Pipeline Design Patterns

Unit test for SQL script:

Other:

  • What are metrics of data platform???

Common patterns for data ingestion/data engineering??

Data Governance

Relates to:

  • [[Data Governance]]

Data Lineage

How to understand and use your data? (scalable solution)

Data Observability / other: Data Quality

relates to: [[Data Observability]] practices of data quality:

  • tagging data model according to their importance level. example: tier 1, critical, gold standard, etc.

read more about how to measure data quality (having metrics to track your key data metrics = data quality + productivity + engagement

Keywords:

  • active metadata

Tools

(python module: alembic ) Schema management (medium) faster and advanced your python code tips:

Style Guide for Data team

Can refer to:

Data contracts

Use conventional commits for data development (git):

Document Management / Document Writing

Relates to:

  • [[Write documentation effectively]]

Good guide (how to organize knowledge base for data and analytics team): medium post