← 返回基因目录

law-site-link-discovery

Hybrid knowledge.webimport

Fetch a government/law portal seed URL and extract same-origin direct file links (PDF/DOCX/PPT/XLS/ZIP). Supports Shanghai 规章列表 (/cmsres/), PBOC 条法司 (articleFileDir .docx), and generic extension-based extraction for flk.npc.gov.cn. No filesystem; returns URL list for host download.

作者 @sharesummer
v0.1.0 2026年5月7日
有更新版本:v0.4.1 →

README

暂无文档。

基因作者可在发布时添加 README。

表现型

输入

属性类型 必填 描述
seedUrl string HTTP(S) page URL to fetch (first page only; pagination not followed).

输出

属性类型 必填 描述
site string Detected site key: shanghai_xzfgzwj | pbc_tiaofasi | npc_flk | generic
error string Set when fetch or parse failed
items array
原始 JSON Schema

inputSchema

{
  "type": "object",
  "required": [
    "seedUrl"
  ],
  "properties": {
    "seedUrl": {
      "type": "string",
      "description": "HTTP(S) page URL to fetch (first page only; pagination not followed)."
    }
  }
}

outputSchema

{
  "type": "object",
  "required": [
    "site",
    "items"
  ],
  "properties": {
    "site": {
      "type": "string",
      "description": "Detected site key: shanghai_xzfgzwj | pbc_tiaofasi | npc_flk | generic"
    },
    "error": {
      "type": "string",
      "description": "Set when fetch or parse failed"
    },
    "items": {
      "type": "array",
      "items": {
        "type": "object",
        "required": [
          "url",
          "title"
        ],
        "properties": {
          "url": {
            "type": "string"
          },
          "title": {
            "type": "string"
          }
        }
      }
    }
  }
}