Pre-launch · Module 1 begins soon

A 12-week semantic web curriculum, worked in public.

RDF, OWL, SPARQL, and modern knowledge graphs — from first principles to a deployed hybrid LLM + knowledge graph application. Every artifact, note, and stumbling point published openly.

Duration
12 weeks · 4 modules
Effort
~5–7 hours / week
Author
Barbara Hidalgo-Sotelo
Brand
Sensemaking AI
The premise

Why publish
learning openly?

Most learning artifacts describe finished states. This one shows the process. The redos, the failed exercises, and the wrong turns are part of the published value, not embarrassments to hide.

i.

Forcing function

Public commitments stall less. Knowing the work is visible turns “I should write notes” into “I have to ship notes.”

ii.

Compounding portfolio

Each module produces real artifacts: published ontologies, evaluation notebooks, deployed demos. The curriculum is the portfolio.

iii.

Honest signal

A real evaluation, including where graph augmentation hurts retrieval, is more useful than a polished case study that only shows the wins.

The arc

Four modules across twelve weeks.

The curriculum builds from foundations (reading and writing the basic data model) to deployment (shipping a system that combines vector and graph retrieval). Module 3 is genuinely the hardest segment — planning slower pacing there matters.

The 12-week curriculum arc Four modules arranged left to right: Foundations, Modeling, Reasoning, and Shipping. Each shows weeks and key concepts. i. Foundations Weeks 1–3 RDF · Turtle · SPARQL ii. Modeling Weeks 4–6 RDFS · OWL · design judgment iii. ★ Reasoning Weeks 7–9 Inference · reification · SHACL iv. Shipping Weeks 10–12 SPARQL UPDATE · LLM + KG From first principles → deployed system LIGHT MEDIUM HEAVY MIXED

The arc is reminiscent of marathon prep: base, build, peak, taper-and-race. Module 3 is the segment where most of the material is genuinely new ground for me, personally.

i.

Foundations

RDF, Turtle, basic SPARQL. The conceptual core: knowledge as triples with global identifiers.

Weeks 1–3
ii.

Modeling

OWL ontology design. The shift from RDF as data format to ontology as design discipline.

Weeks 4–6
iii.

Reasoning

Inference, reification, vocabulary alignment, SHACL validation. The conceptually hardest segment.

Weeks 7–9
iv.

Shipping

SPARQL UPDATE, deployment, hybrid LLM + knowledge graph capstone.

Weeks 10–12
What I’ll learn

The semantic web stack, layered.

Each layer builds on the one below it. The curriculum walks up the stack in order: triples first, schemas next, ontologies and reasoning after that. SPARQL is the query language that runs across all layers.

The semantic web stack Layered diagram showing RDF at the bottom, then RDFS, OWL, SHACL, and Applications at the top, with SPARQL running vertically through all layers. RDF triples · URIs · the graph RDFS classes · subclasses · domain/range OWL restrictions · equivalence · reasoning SHACL constraints · validation · shapes Applications hybrid retrieval · KG explorer SPARQL — queries through all layers Built bottom up
The anchoring projects

Two existing projects as canvases.

The curriculum doesn’t use toy examples. It uses two real projects that already have substantial work behind them. Each anchors specific parts of the curriculum where the technology shines for it.

Primary canvas

Naruto Network Graph

A character network from anime subtitle data — 87 characters across three story arcs, with hand-coded canonical relationships layered over co-appearance edges. The rich categorical structure of the Naruto universe (ninja ranks, villages, jutsu, contested fan canon) makes it an unusually good semantic-web testbed. Pop-culture domains are standard in ontology pedagogy: the Pizza ontology is the canonical Protégé tutorial for the same reason.

87 characters 3 arcs 36 canonical relationships
Module 1 anchor

Resume Graph Explorer

A career graph with ESCO/SKOS integration — existing Neo4j implementation with skill taxonomy linked to the European Skills, Competences, Qualifications and Occupations vocabulary. The reason it anchors Module 1: ESCO is already a SKOS vocabulary, which means the case for RDF over a labeled property graph is exceptionally sharp here. Returns in Module 3 as a venue for skill inference work.

Neo4j + Cypher SKOS + ESCO Module 1 + 3
Project ecosystem The curriculum repo at the center, with three projects orbiting: Naruto Network Graph as primary canvas, Resume Graph Explorer as Module 1 anchor, and TwinKit as the capstone framework. Sensemaking Semantic Web Naruto Network Graph Primary canvas Modules 2, 3, 4 Resume Graph Explorer ESCO/SKOS anchor Modules 1, 3 TwinKit Capstone framework Module 4

Each project enters where it adds the most learning value. Resume Graph for the LPG-vs-RDF case where ESCO makes RDF shine. Naruto for the rich modeling and reasoning work. TwinKit as the capstone framework where everything composes.

The repository

Everything lives here.

The GitHub repository is the canonical record. Syllabus, weekly progress, exercises, ontologies, notes, and published artifacts all live in one place. Visitors land on the README; followers track PROGRESS.md for weekly updates.

// dagny099/semantic-web-curriculum ├── README.md // repo front door ├── SYLLABUS.md // full 12-week curriculum ├── PROGRESS.md // live weekly tracker ← updated 2x/week ├── REFLECTIONS.md // cross-module learnings ├── modules/ │ ├── 01-foundations/ │ ├── 02-modeling/ │ ├── 03-reasoning/ │ └── 04-shipping/ └── resources/ ├── reading-list.md ├── tools.md └── public-kgs.md
Publishing in public

Four platforms, distinct roles.

The work lives across multiple surfaces. Each has a different role and a different voice. GitHub is the canonical record. LinkedIn broadcasts progress. The two blogs handle longer-form thinking that doesn’t belong in either.

When I get started

This weekend: pre-week setup.

The pre-week is roughly 2–3 hours of setup work spread across morning and afternoon. Front-loading these means Module 1’s first day isn’t lost to environment-fighting.

Morning ~1.5 hours

  • Push the initial repo files to GitHub. README, SYLLABUS, PROGRESS, CLAUDE.md, Module 1 README. The repo should look maintained from day one.
  • Install Apache Jena + Fuseki locally. Verify it runs on localhost:3030 and I can create a new dataset.
  • Install Protégé and open it once. Don’t do anything with it yet.
  • Bookmark the W3C primer set. RDF 1.1, Turtle, SPARQL 1.1 Query — these are my spec references.

Afternoon ~1 hour

  • Pick the LinkedIn launch post variant and schedule for Tuesday or Wednesday mid-morning Central time. Repo link goes in the first comment, not the post body.
  • Block 12 Tuesday lunch slots as “synthesis writing” (30 minutes each). That single ritual carries the curriculum forward.
  • Order or borrow Allemang, Hendler & Gandon (3rd ed., 2020) if not already on the shelf.
  • Skim Hogan et al. introduction (kgbook.org sections 1–2). Vocabulary calibration only — don’t try to read the whole thing.

This week (Module 1 begins) ~6 hours

  • Read Allemang chapters 1–3 (RDF foundations, RDFS).
  • Exercise 1.1: Wikidata orientation. Three queries through query.wikidata.org. Notes go in modules/01-foundations/notes/week-01.md.
  • Exercise 1.2: Hand-write a FOAF graph of myself and three of my projects.
  • Begin Exercise 1.3: Resume Graph RDF slice. The Module 1 primary project.
  • Write Week 1 synthesis notes. One paragraph, in my own words, on what it felt like to cover the material.
  • Post first Monday LinkedIn update. “Week 1 of 12” momentum check.
Coming as the work progresses

Placeholders for richer content.

These slots will fill in as each module produces its artifacts. Listed here so the structure is visible from day one and the work has clear targets.

Interactive · Module 1

SPARQL playground for ESCO queries

Embedded query editor with sample queries from the Resume Graph Explorer comparison. Run live against a public endpoint.

Visualization · Module 2

Naruto ontology in WebVOWL

Interactive class hierarchy explorer rendered from the published Turtle. Click any node to see properties and instances.

Video · End of Module 2

Designing an ontology in 3 hours

Time-lapse screen recording of the actual modeling work. The wrong turns visible. Less polish than tutorial videos; more honesty.

Interactive · Module 3

Reification approaches, compared

Side-by-side: same fact expressed in classical RDF reification, n-ary, named graphs, RDF-star. Toggle between styles.

Deployed demo · Module 4

Naruto Knowledge Graph Explorer

Public web app. Ask natural-language questions, see generated SPARQL, explore character networks with ontology overlay.

Dashboard · ongoing

Live progress dashboard

Auto-generated from PROGRESS.md commit history. Modules completed, artifacts shipped, current focus. Embedded on homepage.

Required reading

The canonical references.

None of the conceptual content in the curriculum is original; the contribution is the path through it. These are the books and specs the curriculum points to repeatedly. The full bookmark set lives in resources/reading-list.md.

01
Semantic Web for the Working Ontologist
Allemang, Hendler & Gandon · 3rd edition · ACM Books 2020 · workingontologist.org
Book
02
Learning SPARQL
Bob DuCharme · 2nd edition · O’Reilly 2013 · learningsparql.com
Book
03
Knowledge Graphs
Hogan, Blomqvist, Cochez, et al. · Synthesis Lectures · 2021 · kgbook.org
Open Access
04
Linked Data: Evolving the Web into a Global Data Space
Heath & Bizer · linkeddatabook.com
Open Access
05
W3C Specifications
RDF 1.1 Primer · Turtle · SPARQL 1.1 · OWL 2 Primer · SHACL · SKOS · w3.org/TR
Primary source
About the project

One practitioner, working openly.

This curriculum is published under Sensemaking AI, an independent AI and machine learning consulting practice. It’s run solo by Barbara Hidalgo-Sotelo — cognitive scientist, AI/ML consultant, Toastmaster, marathon runner, beekeeper, and longtime builder of quirky-but-rigorous data projects.

Two web homes, two different jobs. barbhs.com is the technical portfolio — projects, experiments, the body of work. sensemaking-ai.com is the practice — applied AI work for people and organizations who want to think clearly with AI, not just move faster because of it.

The curriculum sits between them. It’s how the technical body of work gets developed in the open, and where the practice’s intellectual commitments get tested against real material.

Questions, corrections, and discussion welcome via the repository’s issues. Pull requests on the curriculum itself aren’t accepted — this is a personal learning project — but disagreements about design choices are always interesting.

Exploring messy data, intelligent systems, and what it means to make meaning — through the lens of a cognitive scientist who builds things people actually use.