Swiftask
  • Quick start
  • Key concepts
    • AI Tools Hub
    • Agents
    • Knowledge base
    • Skills
    • Projects
    • Automation
  • AI tools hub
    • Introduction
    • Chat interface
    • Tokens
    • List of AI features
    • AI suggestions
    • FAQ
  • Agents
    • Introduction
    • Create an agent step by step
    • How to evaluate your agent
    • Multi-agents
    • Widget
    • Share agent
    • FAQ
  • Knowledge base
    • Introduction
    • Data connectors
      • Rich text
      • PDF File
      • Azure Document Loader
      • YouTube
      • Apify Dataset
      • PowerPoint File
      • Excel File
      • DOCX File
      • SQL Database
      • REST API
      • JSON File
      • CSV File
      • SQL Database Query
      • Website
      • Webpage
      • Sitemap
      • Dropbox files
      • Google drive files
    • Create a knowledge base
    • Attach Knowledge base to your agent
    • Share knowledge base
    • FAQ
  • Skills
    • Introduction
    • Skills library
      • Webpage Content Parsing
      • GitLab File Creation
      • Browsing with Perplexity
      • Open API
      • Retriever data from external sources
      • GitHub pull request diff retriever
      • GitHub pull request comment
      • Export table to Excel
      • Export text to PDF
      • GitHub file content
      • GitHub pull request info
      • OpenDataSoft
      • Agent as Skill
      • Swiftask AI recommandation
      • LinkedIn Share
      • Prismic migration create
      • Github create file
    • Create a new skill
    • Attach skill to your agent
    • FAQ
  • Projects
    • Introduction
    • Create a project
    • Generate task
    • Task AI chat
    • Organize chat in project
    • Project agent
    • FAQ
  • Automation
    • Introduction
    • Create an automation
  • Workspace admin
    • Introduction
    • Invite collaborators to join your workspace
    • Referral
    • Subscription renewal and Credit explanation
    • Purchase credits
    • Share agent
    • Subscription Pro plan/Team plan & token distribution
    • Create a project
    • Cancel subscription /Manage payment method
    • Personnal data security
    • SSO For enterprise
  • Use cases & Tutorials
    • Chat with multi-AI
    • Chat with PDF file
    • Import data - Webpage
    • How to generate an image on Swiftask
    • Import data (Azure Document Loader) - PDF
    • How to generate videos on Swiftask
    • Transform your ideas into videos with LUMA AI
    • Upgrade subscription plan
    • How to create an agent? step by step
    • Create AI agents for your business
    • Integrate external API in your agent
    • Create a professional landing page in 5 minutes
    • How to automate your blog content creation with an AI agent
    • How to evaluate your AI agent
    • How to create a Community Manager agent
  • Developer
    • List of AI and agents accessible via API
    • Access AI and agent through API
    • OpenAI SDK
  • Support & Social network
  • Changelog
Powered by GitBook
On this page
  1. Knowledge base
  2. Data connectors

Website

Browses a complete website to extract content and metadata

PreviousSQL Database QueryNextWebpage

Last updated 3 months ago

Detailed Explanation

  1. Name:

    • This field allows you to assign a specific name to your website data source, helping you identify it within your project.

    • Example: You might name it "E-commerce Product Listings" if the data pertains to products listed on an e-commerce site.

  2. Website URL:

    • This field is for specifying the URL of the website you want to scrape or gather data from.

    • Example: If your target website is https://www.example.com, you would enter this URL in the field.

  3. URLs to Exclude:

    • This optional field allows you to specify any URLs that you want to exclude from the scraping process. You can list multiple URLs separated by commas.

    • Example: If you want to exclude URLs like https://www.example.com/about, you can enter:

      Copyhttps://www.example.com/about, https://www.example.com/contact
  4. Chunk Size:

    • This field specifies the number of tokens or characters in each chunk of data processed. The default value is set to 1024, but you can modify it based on your needs. Adjusting the chunk size can help in managing large amounts of data more effectively.

    • Example: If you are gathering a large volume of data, you might set the chunk size to 512 for easier processing.

  5. Cost Information:

    • This section provides details about the cost associated with importing data from the specified website.

    • Example: If it states "Cost per seconds: 220 tokens" and "Remaining seconds: 203620 Seconds," this indicates how many tokens will be charged for each second of data processed and the total seconds of data remaining.