focus for this week: aspects of neuroscience
(popular pages, total: 2,361, >2 mio views)

exploring StatMediaWiki - collect and aggregate information about your wiki

From I ask questions
Jump to: navigation, search

Contents


There are 2 versions of the StatMediaWiki:

  • classic (which is stable)
  • interactive (which is under development)


2nd try

The interactive version 0.1.7 (svn checkout today) works now for me better (some exceptions and so still happen - later more - but I can see more now).
some short facts:

  • xml file tar.gz: 15.9 MB
  • sqlite3 DB: 14.3 MB (if the size is too low - see below - probably it's best to stop, instead waiting so long)

It took some hours until the preprocessor finished the wiki's

  • 1920 pages
  • >7000 revisions

some examples:

1st try (5 months ago)

and both do not work for me yet (see below) :-(

  • The classic version (StatMediaWiki 1.1) due to this bug

interactive version

the menu Analyser in StatMediaWiki - interactive version

It took 50 mins to read:


exported XML file's size: 49.2 MB

One needs to load this xml into the Preprocessor

which creates than a sqlite3 database (size for above xml: 892 kb)

Unfortunately most of the graphs/functions in the menu: Analyser

  • did not work (correct) or
  • were not available: Feature not implemented for the moment. Contributions are welcome.

(screenshots of that menu are viewable on the right side, to make you interested to try this software also :-) )

some examples:

  • Analyser > Global > Pareto:
    • Exception in Tkinter callback - TypeError: unsupported operand type(s) for /: 'tuple' and 'float'
  • Analyser > User-by-User > Activity > All
    • Exception in Tkinter callback - OperationalError: no such column: rev_username
  • Analyser > Page-by-Page > Activity > All
    • just shows 1 page: Help:MediaWiki

Later I'll file some more bug reports (info: this is just a post to inform other people with wikis + share experiences on this)

The "Analyser > User edits network" seems interesting

menu: Analyser > Global summary

As you can see the values are not correct - need to find out why, I checked the xml file's integrity + it was ok.

Pages 1
Revisions 4345 (total)
4322 (by registered users)
23 (by unregistered users)
Revs/pag 4345.00
Users 16 (registered users)
5 (unregistered users)
Revs/user 270.12 (by registered users)
4.60 (by unregistered users)
First edit 2009-03-07 11:05:21 (User:MediaWiki default)
Last edit 2011-08-18 22:01:28 (User:Erkan Yilmaz)
Lifespan 894 days
Links 0 (internal links)
0 (external links)
0 (interwiki links)
Sections 0
Edit summary usage 4066 (94.08%) by registered users
20 (86.96%) by unregistered users
4086 (94.04%) by both
  • Perhaps try to get the xml file with a newer version?
I retried with newest version of the dumpgenerator.py (revision 225 from Jul 15, 2011 - I've also done more edits in the wiki since I ran the interactive version) but still same problem (e.g. only 1 page shown)
the interactive version tells me:
Parsing skilledtestscom_wiki-20110819-history.xml
Reading XML dump...
Total revisions [4376], correctly inserted [4363], errors [13], time [10 secs, 0.178590 minutes]
GENERATED PAGE TABLE: 1
GENERATED USER TABLE: 21
Parsed skilledtestscom_wiki-20110819-history.xml OK!


software used

  • StatMediaWiki Interactive version 0.1.3 (svn version, checked out revision 383)
  • MediaWiki 1.17.0
  • Python 2.6.6 (r266:84292, Sep 15 2010, 16:22:56)
  • Ubuntu 10.10

see also

other wiki analyzers

Mediawiki extensions

Personal tools