Bixin API 数据结构分析

基本信息

数据源类型: JSON API
API URL格式: https://www.bixbiy.com/api/discussions?filter[q]={关键词}&page[limit]=3&include=mostRelevantPost
请求方法: GET
Content-Type: application/json
Referer: https://www.bixbiy.com/
特殊说明: 该网站只提供移动云盘(mobile)链接，域名固定为caiyun.139.com，需要从HTML内容中解析网盘链接和密码

API响应结构

顶层结构

json

{
    "links": {
        "first": "https://www.bixbiy.com/api/discussions?filter%5Bq%5D=%E5%87%A1%E4%BA%BA%E4%BF%AE%E4%BB%99%E4%BC%A0+&page%5Blimit%5D=3&include=mostRelevantPost",
        "next": "https://www.bixbiy.com/api/discussions?filter%5Bq%5D=%E5%87%A1%E4%BA%BA%E4%BF%AE%E4%BB%99%E4%BC%A0+&page%5Blimit%5D=3&page%5Boffset%5D=3&include=mostRelevantPost"
    },
    "data": [
        // 讨论帖子数组
    ],
    "included": [
        // 相关回复内容数组
    ]
}

`data`数组中的讨论帖子结构

json

{
    "type": "discussions",
    "id": "5754",
    "attributes": {
        "title": "凡人修仙传（2025）更新至第8集",
        "slug": "5754",
        "commentCount": 1,
        "participantCount": 1,
        "createdAt": "2025-07-29T15:31:19+00:00",
        "lastPostedAt": "2025-07-29T15:31:19+00:00",
        "lastPostNumber": 1,
        "canReply": false,
        "canRename": false,
        "canDelete": false,
        "canHide": false,
        "isApproved": true,
        "canTag": false,
        "isSticky": false,
        "canSticky": false,
        "isStickiest": false,
        "isTagSticky": false,
        "canStickiest": false,
        "canTagSticky": false,
        "subscription": null,
        "isLocked": false,
        "canLock": false
    },
    "relationships": {
        "mostRelevantPost": {
            "data": {
                "type": "posts",
                "id": "6187"
            }
        }
    }
}

`included`数组中的回复内容结构

json

{
    "type": "posts",
    "id": "6187",
    "attributes": {
        "number": 1,
        "createdAt": "2025-07-29T15:31:19+00:00",
        "contentType": "comment",
        "contentHtml": "<p>凡人修仙传（2025）更新至第8集：<a href=\"https://caiyun.139.com/w/i/2oRhbuZoZbFpi\" rel=\"ugc nofollow\">https://caiyun.139.com/w/i/2oRhbuZoZbFpi</a></p>",
        "renderFailed": false,
        "canEdit": false,
        "canDelete": false,
        "canHide": false,
        "mentionedByCount": 0,
        "canFlag": false,
        "isApproved": true,
        "canApprove": false,
        "canLike": false,
        "likesCount": 0
    }
}

插件所需字段映射

源字段	目标字段	说明
`data[].id`	`UniqueID`	格式: `bixin-{discussion_id}`
`data[].attributes.title`	`Title`	讨论标题
`data[].attributes.createdAt`	`Datetime`	创建时间
`included[].attributes.contentHtml`	`Content`	HTML内容，需要解析提取网盘链接
`""`	`Channel`	插件搜索结果Channel为空
`[]`	`Tags`	标签数组（从标题或内容中提取）
解析的网盘链接	`Links`	从HTML内容中提取的网盘链接

网盘链接解析

HTML内容特点

格式: 包含HTML标签的文本内容，需要清理HTML标签获取纯文本
链接: 以<a href="...">标签形式存在，但更多是纯文本格式
示例:
- HTML格式: <a href="https://caiyun.139.com/w/i/2oRhbuZoZbFpi" rel="ugc nofollow">https://caiyun.139.com/w/i/2oRhbuZoZbFpi</a>
- 纯文本格式: https://caiyun.139.com/w/i/2oRhbuZoZbFpi

支持的网盘类型（bixin专用）

网盘类型	域名特征	示例链接	密码关键词
移动云盘	`caiyun.139.com`	`https://caiyun.139.com/w/i/2oRhbuZoZbFpi`	访问码、密码

重要说明: bixin插件只支持移动云盘，所有链接都是caiyun.139.com域名，不需要处理其他网盘类型。

链接解析策略（bixin专用）

HTML清理: 移除HTML标签，保留纯文本内容
链接提取: 从纯文本中提取移动云盘链接（只处理caiyun.139.com）
密码匹配: 匹配"访问码"或"密码"关键词
位置关联: 密码通常出现在链接附近的行中

插件开发指导

请求示例

searchURL := fmt.Sprintf("https://www.bixbiy.com/api/discussions?filter[q]=%s&page[limit]=3&include=mostRelevantPost", url.QueryEscape(keyword))

请求头设置（参考pan666实现）

req.Header.Set("User-Agent", getRandomUA()) // 使用随机UA避免反爬虫
req.Header.Set("X-Forwarded-For", generateRandomIP()) // 随机IP
req.Header.Set("Accept", "application/json, text/plain, */*")
req.Header.Set("Accept-Language", "zh-CN,zh;q=0.9,en;q=0.8")
req.Header.Set("Connection", "keep-alive")
req.Header.Set("Sec-Fetch-Dest", "empty")
req.Header.Set("Sec-Fetch-Mode", "cors")
req.Header.Set("Sec-Fetch-Site", "same-origin")

SearchResult构建示例

result := model.SearchResult{
    UniqueID: fmt.Sprintf("bixin-%s", discussion.ID),
    Title:    discussion.Attributes.Title,
    Content:  extractTextFromHTML(post.Attributes.ContentHTML),
    Links:    extractLinksFromHTML(post.Attributes.ContentHTML),
    Tags:     extractTagsFromTitle(discussion.Attributes.Title),
    Channel:  "", // 插件搜索结果Channel为空
    Datetime: parseTime(discussion.Attributes.CreatedAt),
}

HTML内容解析函数（参考pan666实现）

// 清理HTML内容（参考pan666的cleanHTML函数）
func (p *BixinAsyncPlugin) cleanHTML(html string) string {
    // 移除
标签
    html = strings.ReplaceAll(html, "
", "\n")
    html = strings.ReplaceAll(html, "
", "\n")
    html = strings.ReplaceAll(html, "
", "\n")
    
    // 移除其他HTML标签
    var result strings.Builder
    inTag := false
    
    for _, r := range html {
        if r == '<' {
            inTag = true
            continue
        }
        if r == '>' {
            inTag = false
            continue
        }
        if !inTag {
            result.WriteRune(r)
        }
    }
    
    // 处理HTML实体
    output := result.String()
    output = strings.ReplaceAll(output, "&amp;", "&")
    output = strings.ReplaceAll(output, "&lt;", "<")
    output = strings.ReplaceAll(output, "&gt;", ">")
    output = strings.ReplaceAll(output, "&quot;", "\"")
    output = strings.ReplaceAll(output, "&apos;", "'")
    output = strings.ReplaceAll(output, "&#39;", "'")
    output = strings.ReplaceAll(output, "&nbsp;", " ")
    
    // 处理多行空白
    lines := strings.Split(output, "\n")
    var cleanedLines []string
    
    for _, line := range lines {
        trimmed := strings.TrimSpace(line)
        if trimmed != "" {
            cleanedLines = append(cleanedLines, trimmed)
        }
    }
    
    return strings.Join(cleanedLines, "\n")
}

// 从文本中提取链接（参考pan666的extractLinksFromText函数）
func (p *BixinAsyncPlugin) extractLinksFromText(content string) []model.Link {
    var allLinks []model.Link
    
    lines := strings.Split(content, "\n")
    
    // 收集所有可能的链接信息
    var linkInfos []struct {
        link     model.Link
        position int
        category string
    }
    
    // 收集所有可能的密码信息
    var passwordInfos []struct {
        keyword   string
        position  int
        password  string
    }
    
    // 第一遍：查找所有的链接和密码
    for i, line := range lines {
        line = strings.TrimSpace(line)
        
        // 只检查移动云盘（bixin只支持移动云盘）
        if strings.Contains(line, "caiyun.139.com") {
            url := p.extractURLFromText(line)
            if url != "" {
                linkInfos = append(linkInfos, struct {
                    link     model.Link
                    position int
                    category string
                }{
                    link:     model.Link{URL: url, Type: "mobile"},
                    position: i,
                    category: "mobile",
                })
            }
        }
        
        // 检查密码/访问码（移动云盘主要使用访问码）
        passwordKeywords := []string{"访问码", "密码"}
        for _, keyword := range passwordKeywords {
            if strings.Contains(line, keyword) {
                // 寻找冒号后面的内容
                colonPos := strings.Index(line, ":")
                if colonPos == -1 {
                    colonPos = strings.Index(line, "：")
                }
                
                if colonPos != -1 && colonPos+1 < len(line) {
                    password := strings.TrimSpace(line[colonPos+1:])
                    // 如果密码长度超过10个字符，可能不是密码
                    if len(password) <= 10 {
                        passwordInfos = append(passwordInfos, struct {
                            keyword   string
                            position  int
                            password  string
                        }{
                            keyword:   keyword,
                            position:  i,
                            password:  password,
                        })
                    }
                }
            }
        }
    }
    
    // 第二遍：将密码与链接匹配
    for i := range linkInfos {
        // 检查链接自身是否包含密码
        password := p.extractPasswordFromURL(linkInfos[i].link.URL)
        if password != "" {
            linkInfos[i].link.Password = password
            continue
        }
        
        // 查找最近的密码
        minDistance := 1000000
        var closestPassword string
        
        for _, pwInfo := range passwordInfos {
            // 移动云盘匹配访问码或密码
            match := false
            
            if linkInfos[i].category == "mobile" && (pwInfo.keyword == "访问码" || pwInfo.keyword == "密码") {
                match = true
            }
            
            if match {
                distance := abs(pwInfo.position - linkInfos[i].position)
                if distance < minDistance {
                    minDistance = distance
                    closestPassword = pwInfo.password
                }
            }
        }
        
        // 只有当距离较近时才认为是匹配的密码
        if minDistance <= 3 {
            linkInfos[i].link.Password = closestPassword
        }
    }
    
    // 收集所有有效链接
    for _, info := range linkInfos {
        allLinks = append(allLinks, info.link)
    }
    
    return allLinks
}

辅助函数（参考pan666实现）

// 从文本中提取URL
func (p *BixinAsyncPlugin) extractURLFromText(text string) string {
    // 查找URL的起始位置
    urlPrefixes := []string{"http://", "https://"}
    start := -1
    
    for _, prefix := range urlPrefixes {
        pos := strings.Index(text, prefix)
        if pos != -1 {
            start = pos
            break
        }
    }
    
    if start == -1 {
        return ""
    }
    
    // 查找URL的结束位置
    end := len(text)
    endChars := []string{" ", "\t", "\n", "\"", "'", "<", ">", ")", "]", "}", ",", ";"}
    
    for _, char := range endChars {
        pos := strings.Index(text[start:], char)
        if pos != -1 && start+pos < end {
            end = start + pos
        }
    }
    
    return text[start:end]
}

// 从URL中提取密码
func (p *BixinAsyncPlugin) extractPasswordFromURL(url string) string {
    // 查找密码参数
    pwdParams := []string{"pwd=", "password=", "passcode=", "code="}
    
    for _, param := range pwdParams {
        pos := strings.Index(url, param)
        if pos != -1 {
            start := pos + len(param)
            end := len(url)
            
            // 查找参数结束位置
            for i := start; i < len(url); i++ {
                if url[i] == '&' || url[i] == '#' {
                    end = i
                    break
                }
            }
            
            if start < end {
                return url[start:end]
            }
        }
    }
    
    return ""
}

// 绝对值函数
func abs(n int) int {
    if n < 0 {
        return -n
    }
    return n
}

// 生成随机UA
func getRandomUA() string {
    userAgents := []string{
        "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36",
        "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.107 Safari/537.36",
        "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/14.1.2 Safari/605.1.15",
        "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:90.0) Gecko/20100101 Firefox/90.0",
        "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.114 Safari/537.36",
        "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.107 Safari/537.36",
    }
    return userAgents[rand.Intn(len(userAgents))]
}

// 生成随机IP
func generateRandomIP() string {
    return fmt.Sprintf("%d.%d.%d.%d", 
        rand.Intn(223)+1,  // 避免0和255
        rand.Intn(255),
        rand.Intn(255),
        rand.Intn(254)+1)  // 避免0
}

时间解析函数

func (p *BixinAsyncPlugin) parseTime(timeStr string) time.Time {
    // 解析ISO 8601格式时间
    t, err := time.Parse("2006-01-02T15:04:05Z07:00", timeStr)
    if err != nil {
        return time.Now()
    }
    return t
}

数据结构定义

API响应结构体

type BixinAPIResponse struct {
    Links    BixinLinks `json:"links"`
    Data     []BixinDiscussion `json:"data"`
    Included []BixinPost `json:"included"`
}

type BixinLinks struct {
    First string `json:"first"`
    Next  string `json:"next"`
}

type BixinDiscussion struct {
    Type         string `json:"type"`
    ID           string `json:"id"`
    Attributes   BixinDiscussionAttributes `json:"attributes"`
    Relationships BixinRelationships `json:"relationships"`
}

type BixinDiscussionAttributes struct {
    Title           string    `json:"title"`
    Slug            string    `json:"slug"`
    CommentCount    int       `json:"commentCount"`
    ParticipantCount int      `json:"participantCount"`
    CreatedAt       string    `json:"createdAt"`
    LastPostedAt    string    `json:"lastPostedAt"`
    LastPostNumber  int       `json:"lastPostNumber"`
    IsApproved      bool      `json:"isApproved"`
    IsLocked        bool      `json:"isLocked"`
}

type BixinRelationships struct {
    MostRelevantPost BixinPostRef `json:"mostRelevantPost"`
}

type BixinPostRef struct {
    Data BixinPostData `json:"data"`
}

type BixinPostData struct {
    Type string `json:"type"`
    ID   string `json:"id"`
}

type BixinPost struct {
    Type       string `json:"type"`
    ID         string `json:"id"`
    Attributes BixinPostAttributes `json:"attributes"`
}

type BixinPostAttributes struct {
    Number           int    `json:"number"`
    CreatedAt        string `json:"createdAt"`
    ContentType      string `json:"contentType"`
    ContentHTML      string `json:"contentHtml"`
    RenderFailed     bool   `json:"renderFailed"`
    IsApproved       bool   `json:"isApproved"`
    LikesCount       int    `json:"likesCount"`
}

特殊处理逻辑

1. 讨论与回复关联

通过relationships.mostRelevantPost.data.id关联讨论和回复
需要在included数组中查找对应的回复内容
一个讨论可能对应多个回复，需要处理所有相关回复

2. HTML内容清理

移除HTML标签获取纯文本内容
解码HTML实体（如<、>等）
提取链接时保留原始URL

3. 链接验证

验证链接是否为有效的网盘链接
过滤掉无效链接（如javascript:、#等）
提取链接中的密码信息

4. 标签提取

从讨论标题中提取关键词作为标签
可以基于内容类型、年份等信息生成标签
支持中文和英文标签

与pan666插件的相似性

特性	bixin	pan666	说明
数据源	论坛讨论API	论坛讨论API	使用相同的论坛系统
API结构	相同	相同	JSON结构完全一致
链接解析	文本解析	文本解析	都需要从HTML清理后的文本中提取
主要网盘	移动云盘	移动云盘	都主要提供移动云盘链接
密码匹配	位置关联	位置关联	使用相同的密码匹配策略
过滤策略	跳过Service层过滤	跳过Service层过滤	都使用`NewBaseAsyncPluginWithFilter`

与其他插件的差异

特性	bixin/pan666	其他插件	说明
数据源	论坛讨论API	网盘搜索API	需要解析HTML内容
链接格式	纯文本格式	直接URL字符串	需要从文本中提取
内容结构	讨论+回复	直接资源信息	需要关联处理
链接验证	必需	可选	论坛可能包含无效链接
过滤策略	跳过Service层过滤	启用Service层过滤	论坛内容需要宽泛搜索

注意事项

HTML解析: 需要正确处理HTML标签和实体，参考pan666的cleanHTML函数
链接提取: 主要从纯文本中提取链接，而非HTML标签
内容关联: 需要将讨论和回复内容正确关联
链接验证: 论坛内容可能包含无效链接，需要过滤
时间解析: 使用ISO 8601格式解析时间
错误处理: API可能返回空数据或格式错误
反爬虫: 使用随机UA和IP避免反爬虫检测
密码匹配: 使用位置关联策略匹配密码和链接

开发建议

优先级设置: 建议设置为优先级3，数据质量一般
Service层过滤: 跳过Service层过滤，使用NewBaseAsyncPluginWithFilter("bixin", 3, true)
HTML处理: 重点处理HTML内容的解析和清理，参考pan666实现
链接提取: 实现robust的链接提取和验证机制，只处理移动云盘（caiyun.139.com）
缓存策略: 建议使用较短的缓存TTL，论坛内容更新频繁
错误日志: 详细记录HTML解析和链接提取的错误信息
基于pan666: 可以直接基于pan666插件进行修改，主要更改API URL和插件名称

API调用示例

搜索请求示例

bash

curl "https://www.bixbiy.com/api/discussions?filter[q]=凡人修仙传&page[limit]=3&include=mostRelevantPost" \
  -H "Referer: https://www.bixbiy.com/" \
  -H "User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36"

完整流程示例

发送搜索请求: 获取讨论列表和回复内容
解析讨论数据: 提取标题、时间等基本信息
关联回复内容: 通过ID关联讨论和回复
清理HTML内容: 移除HTML标签，获取纯文本
提取网盘链接: 从纯文本中提取移动云盘链接（只处理caiyun.139.com）
匹配密码: 使用位置关联策略匹配密码和链接
验证链接有效性: 过滤无效链接
构建搜索结果: 转换为PanSou标准格式
返回结果: 包含标题、内容、链接等信息

插件实现建议

// 基于pan666插件进行修改
func NewBixinAsyncPlugin() *BixinAsyncPlugin {
    return &BixinAsyncPlugin{
        BaseAsyncPlugin: plugin.NewBaseAsyncPluginWithFilter("bixin", 3, true), // 跳过Service层过滤
        retries:         MaxRetries,
    }
}

// 主要修改点：
// 1. 更改API URL: "https://www.bixbiy.com/api/discussions"
// 2. 更改插件名称: "bixin"
// 3. 简化链接提取：只处理移动云盘（caiyun.139.com）
// 4. 简化密码匹配：只匹配"访问码"和"密码"关键词
// 5. 保持相同的HTML解析逻辑

Bixin API 数据结构分析

Bixin API 数据结构分析

基本信息

API响应结构

顶层结构

data数组中的讨论帖子结构

included数组中的回复内容结构

插件所需字段映射

网盘链接解析

HTML内容特点

支持的网盘类型（bixin专用）

链接解析策略（bixin专用）

插件开发指导

请求示例

请求头设置（参考pan666实现）

SearchResult构建示例

HTML内容解析函数（参考pan666实现）

辅助函数（参考pan666实现）

时间解析函数

数据结构定义

API响应结构体

特殊处理逻辑

1. 讨论与回复关联

2. HTML内容清理

3. 链接验证

4. 标签提取

与pan666插件的相似性

与其他插件的差异

注意事项

开发建议

API调用示例

搜索请求示例

完整流程示例

插件实现建议

`data`数组中的讨论帖子结构

`included`数组中的回复内容结构