Go to file
2022-09-13 17:15:14 +02:00
adult first commit 2022-09-13 17:15:14 +02:00
agressif first commit 2022-09-13 17:15:14 +02:00
arjel first commit 2022-09-13 17:15:14 +02:00
associations_religieuses first commit 2022-09-13 17:15:14 +02:00
astrology first commit 2022-09-13 17:15:14 +02:00
audio-video first commit 2022-09-13 17:15:14 +02:00
bank first commit 2022-09-13 17:15:14 +02:00
bitcoin first commit 2022-09-13 17:15:14 +02:00
blog first commit 2022-09-13 17:15:14 +02:00
celebrity first commit 2022-09-13 17:15:14 +02:00
chat first commit 2022-09-13 17:15:14 +02:00
child first commit 2022-09-13 17:15:14 +02:00
cleaning first commit 2022-09-13 17:15:14 +02:00
cooking first commit 2022-09-13 17:15:14 +02:00
cryptojacking first commit 2022-09-13 17:15:14 +02:00
dangerous_material first commit 2022-09-13 17:15:14 +02:00
dating first commit 2022-09-13 17:15:14 +02:00
ddos first commit 2022-09-13 17:15:14 +02:00
dialer first commit 2022-09-13 17:15:14 +02:00
doh first commit 2022-09-13 17:15:14 +02:00
download first commit 2022-09-13 17:15:14 +02:00
drogue first commit 2022-09-13 17:15:14 +02:00
educational_games first commit 2022-09-13 17:15:14 +02:00
examen_pix first commit 2022-09-13 17:15:14 +02:00
exceptions_liste_bu first commit 2022-09-13 17:15:14 +02:00
filehosting first commit 2022-09-13 17:15:14 +02:00
financial first commit 2022-09-13 17:15:14 +02:00
forums first commit 2022-09-13 17:15:14 +02:00
gambling first commit 2022-09-13 17:15:14 +02:00
games first commit 2022-09-13 17:15:14 +02:00
hacking first commit 2022-09-13 17:15:14 +02:00
jobsearch first commit 2022-09-13 17:15:14 +02:00
lingerie first commit 2022-09-13 17:15:14 +02:00
liste_blanche first commit 2022-09-13 17:15:14 +02:00
liste_bu first commit 2022-09-13 17:15:14 +02:00
malware first commit 2022-09-13 17:15:14 +02:00
manga first commit 2022-09-13 17:15:14 +02:00
marketingware first commit 2022-09-13 17:15:14 +02:00
mixed_adult first commit 2022-09-13 17:15:14 +02:00
mobile-phone first commit 2022-09-13 17:15:14 +02:00
phishing first commit 2022-09-13 17:15:14 +02:00
press first commit 2022-09-13 17:15:14 +02:00
publicite first commit 2022-09-13 17:15:14 +02:00
radio first commit 2022-09-13 17:15:14 +02:00
reaffected first commit 2022-09-13 17:15:14 +02:00
redirector first commit 2022-09-13 17:15:14 +02:00
remote-control first commit 2022-09-13 17:15:14 +02:00
sect first commit 2022-09-13 17:15:14 +02:00
sexual_education first commit 2022-09-13 17:15:14 +02:00
shopping first commit 2022-09-13 17:15:14 +02:00
shortener first commit 2022-09-13 17:15:14 +02:00
social_networks first commit 2022-09-13 17:15:14 +02:00
special first commit 2022-09-13 17:15:14 +02:00
sports first commit 2022-09-13 17:15:14 +02:00
stalkerware first commit 2022-09-13 17:15:14 +02:00
strict_redirector first commit 2022-09-13 17:15:14 +02:00
strong_redirector first commit 2022-09-13 17:15:14 +02:00
translation first commit 2022-09-13 17:15:14 +02:00
tricheur first commit 2022-09-13 17:15:14 +02:00
update first commit 2022-09-13 17:15:14 +02:00
vpn first commit 2022-09-13 17:15:14 +02:00
warez first commit 2022-09-13 17:15:14 +02:00
webmail first commit 2022-09-13 17:15:14 +02:00
ads first commit 2022-09-13 17:15:14 +02:00
aggressive first commit 2022-09-13 17:15:14 +02:00
cc-by-sa-4-0.pdf first commit 2022-09-13 17:15:14 +02:00
drugs first commit 2022-09-13 17:15:14 +02:00
global_usage first commit 2022-09-13 17:15:14 +02:00
LICENSE.pdf first commit 2022-09-13 17:15:14 +02:00
mail first commit 2022-09-13 17:15:14 +02:00
porn first commit 2022-09-13 17:15:14 +02:00
proxy first commit 2022-09-13 17:15:14 +02:00
README first commit 2022-09-13 17:15:14 +02:00
violence first commit 2022-09-13 17:15:14 +02:00

These files are contributions for the squidguard software
http://www.squidguard.org

Licence :
---------
These files are under creative commons :
http://creativecommons.org/licenses/by-sa/2.0/

Informations :
--------------
All informations are available on
	http://cri.univ-tlse1.fr/blacklists/

databases :
-----------
Main database :

blacklists.tar.gz       is the compilation of all the databases described ahead.

adult.tar.gz		is a list of adult sites. They are based on
                        - squidguard Robot 
                        - external databases
                        - personnal additions
                        - external additions
                               thanks to
                                        Cedric Foll
                                        David Garroux du CARIP de Lyon
                                        Deckert Florian <net74@sopra.com>
                                        Francesco Mascaro
					Hans Musil
                                        Jago <jago27@usa.net>
                                        Kris Carlier <CARLIER.K@JS.MIL.BE>
                                        Mark Bizzell <bizzell@usq.edu.au>
                                        Mark Kool
                                        Michel Roiron <webmaster@cfa.fr>
                                        Philippe Ferreira <ferreira@atalante-acd.com>
                                        Rick Matthews
					Rogério Pinheiro da Silva (Prodesan)
                                        Sylvain Vincent <sylvain.vincent@cfsa-aftec.com>
                                        Symon Aked
                                        Todd Sieland-Peterson

Last version Monday 12 September 2022 with 4501549 domains and 124316 urls : 19072 Kb.
			

OTHER :
agressif.tar.gz		is a list of aggressive sites (xenophobe, ..)
audio-video.tar.gz	audio and video sites
blog.tar.gz		is a list of blogs
drogue.tar.gz		is a list for drug
forums.tar.gz		is a list of common public mail and chat
                        thanks to
				Arnaud DA COSTA <Arnaud.DA-COSTA@u-bourgogne.fr>
gambling.tar.gz		gambling
games.tar.gz		internet games (flash, online games, ..) 
			thanks to
				Yann Cézard (CRI - Université de Pau et des Pays de l'Adour)  

hacking.tar.gz		
liste_bu.tar.gz		is designed to be used as a FRENCH whitelist for library
                        thanks to
                                 Service de Documentation de l'Universite Toulouse 1
mobile-phone.tar.gz	is a list a mobile dedicated sites
phishing.tar.gz		is a list of phishing sites (came from surbl.org)
publicite.tar.gz	is a list of banner and ad sites
                        thanks to
                                 Jose Pires <pires@nantaise-habitations.fr>
radio.tar.gz		is a small list of radio sites (to prevent radio listening)
redirector.tar.gz	common redirector to bypass filtering
strict_redirector.tar.gz	is a like the previous one with some useful, but "maybe dangerous" sites
                        (cached sites in google, images.google.fr, alltheweb.com and images, ...)
strong_redirector.tar.gz	is a like the previous one with specific "expressions". It blocked only
			some terms in a "google search".
tricheur.tar.gz		is a database of site designed to cheat during exams
warez.tar.gz		is a list of warez sites
webmail.tar.gz		is a list of common webmail
			
			----------------------------
       OF COURSE, mistakes may exist. If you found some, send me a mail :
                 fabrice.prigent@univ-tlse1.fr
	

scripts (beta stage) :
----------------------
blocked.src		is an example of a blocking page (see also squidguard.cgi)
ajout_squidguard.sh	is a script to merge databases
recherche_porno.pl	these two scripts search "inappropriate urls" in a access.log
recherche_porno.sh	
scripts.tar.gz		all scripts in tar form
taille_categorie_squid.pl	print percentage of somes categories