paper_server/apps/resm
caoqianming 93ec54a98f refactor(resm): ScienceDirect 网页下载移出 resm 为独立脚本
按需解耦: 网页(pdfft)下载这类实验性、需人工过 Cloudflare 的功能不进 resm 流水线。

- 新增 scripts/sd_download.py: 独立可运行, DOI->PII->连接手动启动的 Chrome(CDP)
  过验证下载->校验真全文多页 PDF; 与 Django/resm 解耦, 仅读 config 取凭证
- 删除管理命令 try_sciencedirect_pdf, 移除 tasks.py 中 sciencedirect_web 相关函数
- .gitignore: scripts/* 保持忽略, 但放行 scripts/sd_download.py

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-07-01 16:12:51 +08:00
..
management refactor(resm): ScienceDirect 网页下载移出 resm 为独立脚本 2026-07-01 16:12:51 +08:00
migrations feat(resm): 对接材料前沿简报补充期刊/关键词监控 2026-06-29 13:23:55 +08:00
__init__.py feat: 添加resm app 2026-01-23 10:37:41 +08:00
admin.py feat(resm): 期刊/关键词监控 PaperMonitor + 移除每日增量周期任务 2026-06-21 23:43:58 +08:00
apps.py feat: 添加resm app 2026-01-23 10:37:41 +08:00
cloudflare_checkbox2.png feat: 添加pyautogui调用 2026-02-09 15:17:02 +08:00
d_oaurl.py feat:通过cloudflare 验证 2026-03-23 16:30:18 +08:00
d_scihub.py feat:通过cloudflare 验证 2026-03-23 16:30:18 +08:00
filters.py feat(resm): paper 查询加 publication_date 精确 + 范围过滤 2026-06-22 11:12:00 +08:00
models.py feat(resm): 期刊/关键词监控 PaperMonitor + 移除每日增量周期任务 2026-06-21 23:43:58 +08:00
pdf_utils.py perf(resm): fix_preview_pdf 多进程并发扫描 2026-06-29 13:24:45 +08:00
serializers.py feat: paper list 加 pdf_url / xml_url 直链字段 + pg_trgm GIN 索引 2026-05-21 13:48:52 +08:00
services.py feat: 增加download_pdf 2026-01-28 15:01:49 +08:00
tasks.py perf(resm): fix_preview_pdf 多进程并发扫描 2026-06-29 13:24:45 +08:00
tests.py feat: 添加resm app 2026-01-23 10:37:41 +08:00
urls.py feat: 修改pdf 验证cloudflare 2026-03-24 10:34:06 +08:00
views.py feat: paper list 返 abstract + 加 retrieve 端点 + filterset 扩 year range / 多字段 2026-05-21 13:17:46 +08:00