Swiftask
  • Quick start
  • Key concepts
    • AI Tools Hub
    • Agents
    • Knowledge base
    • Skills
    • Projects
    • Automation
  • AI tools hub
    • Introduction
    • Chat interface
    • Tokens
    • List of AI features
    • AI suggestions
    • FAQ
  • Agents
    • Introduction
    • Create an agent step by step
    • How to evaluate your agent
    • Multi-agents
    • Widget
    • Share agent
    • FAQ
  • Knowledge base
    • Introduction
    • Data connectors
      • Rich text
      • PDF File
      • Azure Document Loader
      • YouTube
      • Apify Dataset
      • PowerPoint File
      • Excel File
      • DOCX File
      • SQL Database
      • REST API
      • JSON File
      • CSV File
      • SQL Database Query
      • Website
      • Webpage
      • Sitemap
      • Dropbox files
      • Google drive files
    • Create a knowledge base
    • Attach Knowledge base to your agent
    • Share knowledge base
    • FAQ
  • Skills
    • Introduction
    • Skills library
      • Webpage Content Parsing
      • GitLab File Creation
      • Browsing with Perplexity
      • Open API
      • Retriever data from external sources
      • GitHub pull request diff retriever
      • GitHub pull request comment
      • Export table to Excel
      • Export text to PDF
      • GitHub file content
      • GitHub pull request info
      • OpenDataSoft
      • Agent as Skill
      • Swiftask AI recommandation
      • LinkedIn Share
      • Prismic migration create
      • Github create file
    • Create a new skill
    • Attach skill to your agent
    • FAQ
  • Projects
    • Introduction
    • Create a project
    • Generate task
    • Task AI chat
    • Organize chat in project
    • Project agent
    • FAQ
  • Automation
    • Introduction
    • Create an automation
  • Workspace admin
    • Introduction
    • Invite collaborators to join your workspace
    • Referral
    • Subscription renewal and Credit explanation
    • Purchase credits
    • Share agent
    • Subscription Pro plan/Team plan & token distribution
    • Create a project
    • Cancel subscription /Manage payment method
    • Personnal data security
    • SSO For enterprise
  • Use cases & Tutorials
    • Chat with multi-AI
    • Chat with PDF file
    • Import data - Webpage
    • How to generate an image on Swiftask
    • Import data (Azure Document Loader) - PDF
    • How to generate videos on Swiftask
    • Transform your ideas into videos with LUMA AI
    • Upgrade subscription plan
    • How to create an agent? step by step
    • Create AI agents for your business
    • Integrate external API in your agent
    • Create a professional landing page in 5 minutes
    • How to automate your blog content creation with an AI agent
    • How to evaluate your AI agent
    • How to create a Community Manager agent
  • Developer
    • List of AI and agents accessible via API
    • Access AI and agent through API
    • OpenAI SDK
  • Support & Social network
  • Changelog
Powered by GitBook
On this page
  1. Knowledge base
  2. Data connectors

PDF File

Enables the import and extraction of text from PDF files.

PreviousRich textNextAzure Document Loader

Last updated 3 months ago

Detailed Explanation

  1. Name:

    • This field is for assigning a specific name to your PDF data source. It helps you identify the connector within your project.

    • Example: You could name it "Annual Financial Report" if the PDF file pertains to financial reporting for the year.

  2. PDF File(s):

    • In this section, you can upload one or more PDF files directly from your local system. The interface typically supports drag-and-drop functionality for convenience.

    • Example: If you have a PDF file called 2023_Annual_Report.pdf, you can drag and drop it into this area or click to browse and select it.

  3. PDF File URL(s) (optional):

    • This optional field allows you to enter the URL of one or more PDF files if they are hosted online. This is useful if you prefer to link to a PDF rather than uploading it.

    • Example: You might enter a link like https://example.com/2023_Annual_Report.pdf if the PDF is available on your company's website.

  4. Chunk Size:

    • This field determines the number of tokens or characters in each chunk of data. The default value is 1024, but you can modify it according to your needs. Adjusting the chunk size can help in managing large PDFs more effectively, particularly during processing or analysis.

    • Example: If you are processing a very large document, you might choose to set the chunk size to 512 to ensure smoother handling.

  5. Cost Information:

    • The connector provides information regarding the cost associated with importing words from the specified PDF.

    • Example: If it states "Cost per words: 7 tokens" and "Remaining words: 6399701 Words," this indicates how many tokens will be charged for each word processed and the number of words remaining for processing.