GreekChat.com Forums  

Go Back   GreekChat.com Forums > GLO Specific Forums > Alpha > Alpha Phi Omega
Register FAQ Community Calendar Today's Posts Search

» GC Stats
Members: 329,729
Threads: 115,665
Posts: 2,205,001
Welcome to our newest member, samuelpetrvoz32
» Online Users: 1,678
2 members and 1,676 guests
navane
 
 
Thread Tools Display Modes
Prev Previous Post   Next Post Next
  #1  
Old 04-28-2008, 02:56 PM
naraht naraht is offline
GreekChat Member
 
Join Date: Apr 2001
Location: Rockville,MD,USA
Posts: 3,543
Automation of download of chapter list page

I've written a script that will download the information on the
http://www.apo.org/show/How_to_Start...r/Chapter_List page (after you click on go) and boil it down to a vertical bar delimited file suitable for importing into spreadsheets and databases. This should work on any Unix/Linux/Mac machine with nc (netcat) on it. It might be called nc or netcat on different machines.

#!/bin/bash
(nc www.apo.org 80 < nc.apo.in) | grep "<table>"| sed -e 's#</tr><tr>#</tr>+<tr>#g'| tr "+" "\n"| grep -v colspan | grep -v "td width="| sed -e 's/Send Email//g'|sed -e 's#</b><br>#|#g' | sed -e 's#<br>Region:#|Region:#g' |sed -e 's#<i>#|#g'| perl -pe 's/<[^>]*>//g' > apo`date '+%y%m%d'`
cut -f 4 -d \| apo`date '+%y%m%d'`| sort | uniq -c > apo`date '+%y%m%d'`.count

All of the line breaks except the one before the word 'cut' are simply from wordwrap and should not be in the program.

In addition the file nc.apo.in needs to exist which contains

POST /show/How_to_Start_a_Chapter/Chapter_List HTTP/1.0
Content-Length: 71
Content-Type: application/x-www-form-urlencoded
User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)
Host: www.apo.org

bystatus=0&byregion=0&bysection=0&bycity=&bystate= 0&bysort=S&submit=Go
__________________
Because "undergrads, please abandon your national policies and make something up" will end well --KnightShadow

Last edited by naraht; 04-28-2008 at 03:20 PM. Reason: added other file.
Reply With Quote
 


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Chapter Info Page Oldest_Pledge Beta Theta Pi 13 01-15-2008 11:10 AM
Chapter list beachie_keene Alpha Phi 5 11-27-2006 01:06 PM
Best Teke chapter web page? hoosier Tau Kappa Epsilon 2 11-19-2005 01:38 AM
What chapter wants a web page? WhiteDaisy128 Delta Gamma 3 07-26-2003 05:28 AM
Chapter Web Page Jocelyn Alpha Kappa Alpha 0 09-19-2002 07:18 PM


All times are GMT -4. The time now is 08:05 PM.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2025, vBulletin Solutions Inc.