Recipe parser

Norish imports structured recipes through a parser service. By default, non-video URL imports go through the Python parser API first, scraping the page with a headless Chrome instance.

How it works

Structured imports use the recipe-scrapers library (wrapped by apps/parser-api). Page scraping uses headless Chrome at CHROME_WS_ENDPOINT — the chrome-headless service in the Quick start compose.

Settings

Variable	Description	Default
`CHROME_WS_ENDPOINT`	Playwright CDP WebSocket endpoint for scraping	`ws://chrome-headless:3000`
`PARSER_API_TIMEOUT_MS`	Parser API timeout in milliseconds	`15000`
`LEGACY_RECIPE_PARSER_ROLLBACK`	Re-enable the deprecated legacy parser	`false`

Content detection

Advanced overrides for how recipe content is detected. Most instances never need these.

Variable	Description	Default
`UNITS_JSON`	Override the units dictionary	(empty)
`CONTENT_INDICATORS`	Override recipe-content indicator configuration	(empty)
`CONTENT_INGREDIENTS`	Override ingredient-content configuration	(empty)

Rollback to the legacy parser

If the parser API is unhealthy, you can temporarily switch structured imports back to the deprecated JSON-LD and microdata parser:

Set LEGACY_RECIPE_PARSER_ROLLBACK=true and restart Norish. Structured imports now use the legacy parser.
Once the parser API is healthy again, set LEGACY_RECIPE_PARSER_ROLLBACK=false and restart.

How it works​

Settings​

Content detection​

Rollback to the legacy parser​

How it works

Settings

Content detection

Rollback to the legacy parser