npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2024 – Pkg Stats / Ryan Hefner

webquery

v1.0.9

Published

Query the web with SQL-like syntax

Downloads

19

Readme

WebQuery

Query the web with SQL-like syntax
Inspired by the great Yahoo! YQL, this tool can help you generate stub files for development, scrap data from multiple sources for your portal, perform website's health check, test your app or just for fun!

Installation

npm install -g webquery

Usage

Terminal (*NIX)

webquery [options...]
For example: To print to console the content, the lang attribute value and the number of children elements of all paragraph elements which has content class in page https://twitter.com/feditorio - just run the following command:
webquery -l -q "SELECT text, attr(lang), size(children) as total FROM https://twitter.com/feditorio WHERE jquery=(.content p)"

Options:

  • -q "QUERY" - Query statement
  • -f "JSON_OUTPUT_FILE_PATH" - JSON output file path
  • -ua "USER_AGENT" - Valid browser user agent
  • -l - Indicates whether results should be logged to console
  • -h - Prints usage information
  • -v - Prints the version number

Node App

var wq = require('webquery');

// Arguments:  
// 1 - {string} Query statement  
// 2 - {string} JSON output file path  
// 3 - {string} Valid browser user agent
// 4 - {boolean} Indicates whether results should be logged to console  
// Returns a promise
wq.query('SELECT text, attr(lang), size(children) as total FROM https://twitter.com/feditorio WHERE jquery=(.content p)', null, true).then(
  function success (result) {
    console.log('Query completed successfully!');
    console.log(result);
  },
  function error (err) {
    console.error('Query failed to complete: %s', err);
  }
);

Query Statement

SELECT {PROPERTY1}[, {PROPERTY2}[,...]] FROM {URL1}[, {URL2}[,...]] WHERE {SELECTOR1} [OR {SELECTOR2} [OR...]]

Property

You can use single or multiple comma-separated properties from the list below:

  • tag - Tag name
  • type - Element type
  • html - HTML contents
  • text - Combined text contents, including their descendants
  • value - Current value (form element)
  • id - Id attribute value
  • name - Name attribute value
  • class - CSS class names
  • index - Position of the element, relative to its sibling elements
  • attr(_attribute_) - Value of attribute attribute
  • data(_attribute_) - Value of data attribute attribute (without the data- prefix)
  • size(children) - Number of children
  • size(attributes) - Number of attributes

URL

Any valid URL which starts with http:// or https:// protocols.
You can query single or multiple comma-separated urls.

SELECTOR

You can use either jquery or xpath valid selectors. You may also mix them both or use multiple selectors of each type you like, separated with OR operator:

  • jQuery: WHERE jquery=(YOUR_SELECTOR_GOES_HERE)
    For example: WHERE jquery=(p > div.content)

  • XPath: WHERE xpath=(YOUR_SELECTOR_GOES_HERE)
    For example: WHERE xpath=(/*[@id=’foo’])

  • Mixed: 'WHERE jquery=(p > div.content) OR WHERE jquery=(#messages li) OR xpath=(/*[@id=’foo’])'

Output

{
  "meta": {
    "date": 1439761398928,          // UNIX time in which query was executed
    "duration": 2881,               // Time in milliseconds it took the query to complete
    "url": [                        // Array of URLs which were used in the "FROM" clause
      "https://my.website.com"
    ],
    "title": [                      // An array of pages titles of the url(s) above
      "My Website"
    ],
    "items": 36                     // Number of items found
  },
  "data": [                         // Array of all items which were found
    {
      // ..
    },
    //...
  ]
}

Known Issues

You may experience problems while executing webquery if you had to use SUDO to install it globally.
In general, it is most recommended to use NPM without having to run commands as administrator. To do so, follow the instructions below:

  1. Change prefix in NPM configuration:
    npm config set prefix ~/npm
  2. Add NPM's bin folder to your system's PATH in ~/.bashrc:
    PATH=$PATH:$HOME/npm/bin
  3. Reload ~/.bashrc:
    . ~./.bashrc

You may re-install now webquery package.