Trees In Cuenca Ecuador

Description

Important! I am not the owner nor do I claim copyright on the content from the PDF. All credit belongs to:

This purpose of this project is to demonstrate how text and images can be scraped from a PDF using NodeJS. It is then displayed with a very simple AngularJS application.

The images in the PDF are first dumped to an SVG format with a utility created by the team of PDF.js. The base64 content is then extracted and saved into the individual files. The naming convention is such that the photos can then later be associated with the appropriate tree.

Technologies and Methodologies Used

AngularJS
Spectre Front End Library
NodeJS
Regular Expressions
JSON / SVG

Building the data

node scripts/dump-pdf-to-svg.js app/resources/data/arboles.pdf
node scripts/extract-images-from-svg.js 
node scripts/extract-text-from-pdf.js
node scripts/associate-images-to-tree-group.js

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
app		app
scripts		scripts
.bowerrc		.bowerrc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bower.json		bower.json
index.html		index.html
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trees In Cuenca Ecuador

Description

Technologies and Methodologies Used

Building the data

About

Releases

Packages

Languages

License

travispence/trees-in-cuenca-ecuador

Folders and files

Latest commit

History

Repository files navigation

Trees In Cuenca Ecuador

Description

Technologies and Methodologies Used

Building the data

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages