Advanced Web Searching 
November 18, 2014
Google alerts 
How Google Alerts work 
“You can get email notifications any time that Google finds new results on a topic you’re interested in. For example, you could get updates about a product you like, find out when people post content about you on the web, or keep up with news stories.”
filetype:docx, pdf, xls, ppt, pptx 
filetype: docx, pdf, xls, ppt, pptx 
Typing filetype: xls in a search will show only spreadsheets. It’s incredibly useful for finding public data. 
i.e. filetype:ppt “the affordable care act” 
Google will search for only PowerPoint files and affordable care act content
site: 
“affordable care act” statistics site: whitehouse.gov 
ebola statistics site: cdc.gov
numerical ranges..operator 
Search within numerical ranges using the .. operator. 
Olympics 1950..1960 
new york city mayor election results 1950..1960 -wikipedia [Results]
Reverse Google image 
●“You can use a picture as your search to find related images from around the web. For example, if you search using a picture of your favorite band, you can find similar images, websites about the band, and even sites that include the same picture.” 
●“Search by image works best when the image is likely to show up in other places on the web. So you’ll get more results for famous landmarks than you will for personal images like your latest family photo.”
upload an image
archive.org 
TThe Wayback Machine
About the Internet Archive 
The Internet Archive is a 501(c)(3) non-profit that was founded to build an Internet library. Its purposes include offering permanent access for researchers, historians, scholars, people with disabilities, and the general public to historical collections that exist in digital format. 
Founded in 1996 and located in San Francisco, the Archive has been receiving data donations from Alexa Internet and others. In late 1999, the organization started to grow to include more well-rounded collections. Now the Internet Archive includes: texts, audio, moving images, and software as well as archived web pages in our collections, and provides specialized services for adaptive reading and information access for the blind and other persons with disabilities.
Who owns the building? 
Remember…..NYC Map 
http://maps.nyc.gov/doitt/nycitymap/ 
ACRIS 
http://a836-acris.nyc.gov/CP/
What is the “deep web?” 
Anything behind a pay wall that requires registration or a query search in NOT indexed by Google.
Guidestar.org
501(c)(3) orgs: Form 990
OpenSecrets.org
Deleted tweets from politicians 
http://politwoops.sunlightfoundation.com/
Who is John Doe? 
General Backgrounding 
Who Is John Doe? 
Investigative Reporters & Editors checklist for finding biographical, legal, business, criminal, etc. info on a person
Bellingcat 
Bellingcat -- a company that uses citizen journalists and social media for investigative reporting. They were able to figure out the Russian government was lying about the missile launcher apparently used to take down Malaysian Airlines Flight 17.
https://wikileaks.org/
governmentattic.org
Look for professional licenses. Find “jennifer” nurse in north carolina
Any other cool web tricks?

Advanced web searching

  • 1.
    Advanced Web Searching November 18, 2014
  • 3.
    Google alerts HowGoogle Alerts work “You can get email notifications any time that Google finds new results on a topic you’re interested in. For example, you could get updates about a product you like, find out when people post content about you on the web, or keep up with news stories.”
  • 4.
    filetype:docx, pdf, xls,ppt, pptx filetype: docx, pdf, xls, ppt, pptx Typing filetype: xls in a search will show only spreadsheets. It’s incredibly useful for finding public data. i.e. filetype:ppt “the affordable care act” Google will search for only PowerPoint files and affordable care act content
  • 5.
    site: “affordable careact” statistics site: whitehouse.gov ebola statistics site: cdc.gov
  • 6.
    numerical ranges..operator Searchwithin numerical ranges using the .. operator. Olympics 1950..1960 new york city mayor election results 1950..1960 -wikipedia [Results]
  • 7.
    Reverse Google image ●“You can use a picture as your search to find related images from around the web. For example, if you search using a picture of your favorite band, you can find similar images, websites about the band, and even sites that include the same picture.” ●“Search by image works best when the image is likely to show up in other places on the web. So you’ll get more results for famous landmarks than you will for personal images like your latest family photo.”
  • 8.
  • 11.
  • 12.
    About the InternetArchive The Internet Archive is a 501(c)(3) non-profit that was founded to build an Internet library. Its purposes include offering permanent access for researchers, historians, scholars, people with disabilities, and the general public to historical collections that exist in digital format. Founded in 1996 and located in San Francisco, the Archive has been receiving data donations from Alexa Internet and others. In late 1999, the organization started to grow to include more well-rounded collections. Now the Internet Archive includes: texts, audio, moving images, and software as well as archived web pages in our collections, and provides specialized services for adaptive reading and information access for the blind and other persons with disabilities.
  • 14.
    Who owns thebuilding? Remember…..NYC Map http://maps.nyc.gov/doitt/nycitymap/ ACRIS http://a836-acris.nyc.gov/CP/
  • 15.
    What is the“deep web?” Anything behind a pay wall that requires registration or a query search in NOT indexed by Google.
  • 16.
  • 17.
  • 19.
  • 21.
    Deleted tweets frompoliticians http://politwoops.sunlightfoundation.com/
  • 23.
    Who is JohnDoe? General Backgrounding Who Is John Doe? Investigative Reporters & Editors checklist for finding biographical, legal, business, criminal, etc. info on a person
  • 24.
    Bellingcat Bellingcat --a company that uses citizen journalists and social media for investigative reporting. They were able to figure out the Russian government was lying about the missile launcher apparently used to take down Malaysian Airlines Flight 17.
  • 25.
  • 26.
  • 27.
    Look for professionallicenses. Find “jennifer” nurse in north carolina
  • 28.
    Any other coolweb tricks?