Source Detection Activity
UMLActivity
Skill Seekers
::
skill_seekers
::
Source Detection Activity
Description
none
Diagrams
16 Source Auto-Detection
Nodes
start
SourceDetector.detect(source)
file extension?
Config (.json)
PDF / Word / EPUB / Jupyter / HTML / PPTX / AsciiDoc / Man / Video / RSS
video URL?
Video Scraper
is directory?
Codebase Scraper
GitHub pattern?
GitHub Scraper
http/https URL?
Web Doc Scraper
bare domain?
Web Scraper (https:// prepended)
Error: Cannot determine source type
merge
end
Edges
(start→SourceDetector.detect(source))
(SourceDetector.detect(source)→file extension?)
[.json] (file extension?→Config (.json))
[.pdf/.docx/.epub/.ipynb/.html/.pptx/etc] (file extension?→PDF / Word / EPUB / Jupyter / HTML / PPTX / AsciiDoc / Man / Video / RSS)
[no match] (file extension?→video URL?)
[youtube/video URL] (video URL?→Video Scraper)
[no] (video URL?→is directory?)
[yes: os.path.isdir] (is directory?→Codebase Scraper)
[no] (is directory?→GitHub pattern?)
[owner/repo or github.com URL] (GitHub pattern?→GitHub Scraper)
[no] (GitHub pattern?→http/https URL?)
[yes] (http/https URL?→Web Doc Scraper)
[no] (http/https URL?→bare domain?)
[has dot, not path] (bare domain?→Web Scraper (https:// prepended))
[no] (bare domain?→Error: Cannot determine source type)
(Config (.json)→merge)
(PDF / Word / EPUB / Jupyter / HTML / PPTX / AsciiDoc / Man / Video / RSS→merge)
(Video Scraper→merge)
(Codebase Scraper→merge)
(GitHub Scraper→merge)
(Web Doc Scraper→merge)
(Web Scraper (https:// prepended)→merge)
(Error: Cannot determine source type→merge)
(merge→end)
Properties
Name
Value
name
Source Detection Activity
stereotype
null
visibility
public
isReentrant
true
isReadOnly
false
isSingleExecution
false
Owned Elements
16 Source Auto-Detection