Can Online Plagiarism Be Stopped?

07/13/10

Permalink 07:25:16 pm, by jfiore English (CA)
Categories: Opinion

Can Online Plagiarism Be Stopped?

I stumbled on a tweet earlier this afternoon which really caught my attention. I was reserved about blogging on the matter, but really felt the message to be too important to not bring it out in the open. It started with a tweet from Matt Ridings (@techgeurilla) which drew attention to a blog site which was scraping Olivier Blanchard's (@thebrandbuilder) blog post:

Can Online Plagiarizing Be Stopped?

I don't want to draw any more attention than needed to the scraped post - suffice it to say that I found the idea of a company copying a post written by Olivier word for word on their own blog to be beyond belief. And as unbelievable as it may sound, it appears they have copied more than one of Olivier's posts.

Plagiarism by Any Other Name

In my second year of University, I decided to specialize in something so I chose to do a major in Public Policy and Administration. It was a fancier way of saying I was going to major in Political Science. The introductory core course was taught by a tough but fair professor. She was extremely bright, and was fast approaching tenure at the University.

It was nearing the end of the school year, and she had been prepping us for weeks on a paper that was going to make up a huge part of our grade. Because it was a core course, we needed to score a B or higher in order to stay in the program, and while I probably made it sound like a bird course, it was actually a fairly competitive program, only second to the University's business program.

Many early warnings and instructional sessions took place leading up to that important paper - everything from the "how to's" of writing an essay right through to proper attribution and sourcing of reference material. York University defines plagiarism as "the practice of claiming, or implying, original authorship of (or incorporating material from) someone else's written or creative work, in whole or in part, into one's own without adequate acknowledgement."

I'll never forget the feeling when I entered the classroom and watched/heard the department head giving a very serious and stern lecture the day we were expecting our essays back. 70% of the class had been caught for plagiarizing - that amounted to close to 200 students. I was panicking so hard that day, trying desperately to recall what I had written, and hoping I wasn't one of those students. Thankfully, I wasn't, and many not only failed the course, but were facing academic probabtion due to the severity of the offense. I'll return back to the rest of this story later in the post.

Uniqueness and Creativity

With the amount of social media monitoring we perform, it isn't unusual to stumble upon scraped posts originating from servers sprawled all over Eastern European countries. In cases such as these, copyright on the Web can be tricky business because laws written in one country carry little or no weight in other parts of the world. This particular incident is a little different from the kind of blog scraping we see, and that's because it points to a marketing and design agency based out of Portland, Oregon. And yet despite the fact that it's a company on American soil, it would seem that many of the back and forth chats with Olivier and Matt earlier, online scraping and/or plagiarizing isn't anything new and it's happened to quite a few other well known bloggers.

Flattery or Dishonesty

As a sideline observer to this whole notion of "imitation as the best form of flattery" I must admit that I find nothing flattering about it. Outside of the ethical wrongdoing, I'd have to think that the worst part of being caught plagiarizing is that you're really admitting you don't have anything unique or creative to contribute. But how is an offender made to realize that ripping off material and claiming it as their own is unacceptable and does more reputational harm than good?

It appears in this case, the best thing to do would be to communicate with the company, requesting that they either alter their approach or altogether stop scraping content that doesn't rightfully belong to them. I've read suggestions about reporting scraping sites (i.e if they use AdSense, reporting it to Google) or alerting the hosting company they use about their activities. I'll leave the litigious aspects of copyright infringement to the lawyers.

Make Waves and Drown

Currently, a search on the title of Olivier's post shows 6 of the top 10 results referring to the original post. This 60/40 split makes me wonder whether situations like this hurt bloggers search rankings. Whether or not they are aware of the scraping, I'd have to believe that content readers will figure it out most of the time for themselves. But with all the changes in social media influencing search rankings, and Twitter spam getting its fair share of the credit that in the past would more appropriately reward blogger originality and creativity, is there an SERM risk in allowing incidents like this to go on without penalty? And what, if anything, can be done to stop it from happening?

Going back to the story about the students from my University class who were caught plagiarizing, they didn't fare all that well after being caught. In fact, there were rumors that many of the students parents ended-up complaining about the situation, which unfortunately led to the professor being dismissed from the program, and not getting tenure. In retrospect, it seems entirely unjust that the professor was caught in the crossfire, especially when all she wanted to do was make sure our academic prosperity was founded on principles of honesty and integrity. The experience left a permanent impression about the consequences of plagiarism, and this most recent incident with copying Olivier's blog post leaves me wondering whether the social Web's evolution will be built on unearned reputations, false claims of ownership, or rather, the kind of imagination and creativity which will give it dynamic staying power, and eventually rids scraping and plagiarism for good.

Post on Twitter


Site Meter

Comments, Pingbacks:

Comment from: Mark W. Schaefer [Visitor] Email · http://www.businessesgrow.com
No, we can't stop it. I don't even know how to keep up with it. I would say I have a post completely ripped off and posted without attribution 1-2 times a month. But those are the ones I find out about. Most of the time I stumble upon something suspicious and it has often been translated into another language.

I usually write a polite note at least asking for attibution. Sometimes it happens sometimes it doesn't.

And by the way, here's a story about how famous film maker Michale Moore ripped of and took credit for a local news story: http://shar.es/mBgQu

The integrity and fairness of the Internet is permanently broken.

Thanks for the great post Joseph.
PermalinkPermalink 07/13/10 @ 23:24
Comment from: Adam Budd [Visitor] Email · http://adambudd.wordpress.com
I had something similar happen, I wrote a blog about websites like Twitter and Facebook being a "social media highway"... someone who commented on it and liked it a lot ended up making their name on Twitter "SocialMediaHWY" or something to that effect.

Not that I don't appreciate her reading, but I was kind of perturbed when I realized that.
PermalinkPermalink 07/13/10 @ 23:33
Comment from: autom [Visitor] Email · http://autom.x.iabc.com
stopped? hmm .. sadly i doubt it. for some there will always be an urge to take shortcuts at the risk of their own reputation and credibility

perhaps they don't care or quite simply have no imagination or are plain desperate

what could potentially be disconcerting is if say a given majority would view this online behaviour as perfectly acceptable and chalk it up to some lame excuse about how the seemingly gratuitous 'sharing' and 'openness' of the web "naturally engender" these tendencies

i would not be surprised if some actually do advocate that view .. while thumbing a ride to #area53 ;)

PermalinkPermalink 07/14/10 @ 00:30
Comment from: jfiore [Member] Email
@Mark, thanks for stopping by and and commenting. I thought you might be going through this problem due to the popularity of your blog. It is rather unfortunate that it's become so difficult for bloggers to track this type of thing. Thanks also for link sharing that article - more proof that things may be broken beyond repair.

@Adam, thanks for stopping-by and sharing your story. I find myself worrying about this sometimes myself. I read so much content that I wonder if I indeliberately am recalling thoughts, or even a coined term I've read somewhere. The rule is that if I think I might have seen/read it somewhere else, and can't recall the source, I just won't use it.

@Autom I agree. And it's funny that you mention the possibility of people taking this approach at wholesale. While writing this yesterday, I thought about how we automatically assume role of conservator with Twitter's ecosystem - whether it be calling out someone who spams, or making sure attribution of links/RT's is done using proper form and etiquette. Is it because the 140 character messaging makes it easier to manage? Have we allowed content from the other moving parts of the social Web to evolve and become too unwieldy or too difficult to manage, co-ordinate and/or preserve?

After Thoughts

I've noticed that many News sites have implemented javascript code which automatically alters any cutting and pasting done from any of their content. Meaning, if someone were to try to copy the title, short snippet, or article in its entirety, this code automatically inserts a link when it is pasted. This is one way to enforce attribution, and if the scraper uses the lame excuse that the scraping is being done by some automated tool, then there should be no reason for the attribution link not to appear when it is pasted on their blog or other social media platform.

I believe this code is part of a content management solution offered by a company called Demand Media. The thing I like about this most is that if you wanted to control the way your blog content is shared, you could use this code to force a "teaser" type of snippet, which includes a "Read More" link that will direct readers stumbling on the post in other parts of the social Web to your post. From a copyright standpoint, if someone decides they still want to rip off the entirety of your article, even though you'd prefer they only take a few paragraphs and read more link from the site the content originates, then if the content still gets mirrored in its entirety, you'll know the only way they pulled it off was to type it out. This may change things from the point of view of disabling automated scraping tools on terms that are agreeable with the content creators and perhaps even help enforce copyright.

I also don't imagine producing this type of functionality for blog sites will be too difficult a task - in fact it seems that it's the perfect time to pressure Wordpress and other blog software developers to produce plugins to help bloggers safeguard their work and nip the scraping/plagiarism problem at the bud.
PermalinkPermalink 07/14/10 @ 09:14

This post has 1 feedback awaiting moderation...

Leave a comment:

Your email address will not be displayed on this site.
Your URL will be displayed.

Allowed XHTML tags: <p, ul, ol, li, dl, dt, dd, address, blockquote, ins, del, span, bdo, br, em, strong, dfn, code, samp, kdb, var, cite, abbr, acronym, q, sub, sup, tt, i, b, big, small>
(Line breaks become <br />)
(Set cookies for name, email and url)
(Allow users to contact you through a message form (your email will NOT be displayed.))

RepuTrack™

RepuTrack™ monitors online media from Web sites, blogs, message boards, forums, chatrooms, microblogs, social networking sites, video and images worldwide.

To schedule a free online-demonstration of RepuTrack™, click here.

RepuTrack™ - Online Reputation Monitoring

February 2012
Sun Mon Tue Wed Thu Fri Sat
 << <   > >>
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29      

Search

Categories

Misc

XML Feeds

What is RSS?

Who's Online?

  • Guest Users: 3