Implement hybrid approach for AI news

- Update ainews script to detect OpenAI URLs and mark as NEEDS_WEB_FETCH
- Update TOOLS.md with content availability table and hybrid workflow
- Update all 4 AI news cron jobs (10:05, 14:05, 18:05, 22:05) with hybrid instructions
  - Simon/Raschka: use ainews articles (fivefilters works)
  - OpenAI: use web_fetch tool (JS-heavy site)
This commit is contained in:
Agent 2026-02-03 22:28:31 +00:00
parent e6248879b3
commit c7e2d429c0
5 changed files with 228 additions and 23 deletions

View file

@ -103,7 +103,7 @@ curl -s -X REPORT -u "$NEXTCLOUD_USER:$NEXTCLOUD_PASS" \
"$NEXTCLOUD_URL/remote.php/dav/calendars/$NEXTCLOUD_USER/$CALDAV_CALENDAR/"
```
## AI News RSS
## AI News RSS (Hybrid Approach)
Helper script: `~/bin/ainews`
@ -120,10 +120,18 @@ ainews reset # Clear seen history
- Auto-tracks seen articles in `memory/ainews-seen.txt`
- Auto-prunes to 200 entries
**Workflow for AI news briefing:**
1. `ainews items` → shows NEW articles, marks them as seen
2. Pick interesting ones, optionally fetch full content with `articles`
3. Next briefing: only shows articles published since last check
**Content availability by source:**
| Source | Full Content | Method |
|--------|-------------|--------|
| Simon Willison | ✅ In RSS/fivefilters | `ainews articles` |
| Sebastian Raschka | ✅ In RSS/fivefilters | `ainews articles` |
| OpenAI Blog | ❌ JS-rendered | Use `web_fetch` tool |
**Hybrid workflow for AI news briefing:**
1. `ainews items` → shows NEW articles from all sources
2. For Simon/Raschka: `ainews articles <urls>` to get full content
3. For OpenAI: Use `web_fetch` tool directly (fivefilters can't extract JS sites)
4. Write briefing with all content
---